REMARKS 


Claim Amendments 

The claims have been amended to clarify the claim language, place some previously 
dependent claims into independent form, and to correct the multiple dependent claims accordingly. 
Support for the addition of "comprises an iron-sulfur cluster" to Claims 2, 5, 21, 36 and 69-71 is 
found in Claim 1 as previously presented All remaining amendments are clerical in nature. 

Objection to the Specification and Rejection of Claims 1. 2, 4. 5. 7, 21. 24, 36.39. 44. 47. 48. 50-52, 
54-56. 58-65. and 68-71 Under 35 U.S.C. § 1 12, First Paragraph (Written Description) : 

The Examiner has objected to the specification and rejected Claims 1, 2, 4, 5, 7, 21, 24, 
36,39, 44, 47, 48, 50-52, 54-56, 58-65, and 68-71 under 35 U.S.C. § 112, first paragraph, on the 
basis of written description. Specifically, the Examiner states that Applicants' arguments regarding 
the various structural features of the claimed oxygenase (e.g., free iron-binding domains) that were 
known in the art at the time of the invention to be conserved among all oxygenases of this same type 
and to be required for function of the enzyme are not persuasive, because the Examiner contends that 
the presence of free iron-binding domains in a monoxygenase does not describe a dicamba-degrading 
oxygenase of the present invention, but rather describe a genus beyond dicamba-degrading 
oxygenases. The Examiner states that an invention may not be adequately described when it is 
described solely in terms of a method of making it coupled with its function and there is no described 
or art-recognized correlation or relationship between the structure of the invention and its function. 
With regard to Applicants' prior arguments that the art recognizes that within the Rieske non-heme 
iron-binding family of oxygenases, the location of the substrate binding region is known to be in 
proximity to the site of the free iron atom that is a catalyst for the reaction (with the binding site for 
the iron atom being conserved among this type of oxygenase), the Examiner argues that this is not 
persuasive because Applicant does not describe the substrate binding region of the claimed 
oxygenase. The Examiner further dismisses Applicants' arguments that sequence similarity with 
other monoxygenases of the same type that degrade substrates other than dicamba provides 
significant information regarding the structure-to-function relationship of the enzyme. With regard 
to the Weeks Declaration of 2004, the Examiner states that while this data might provide evidence 
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that one of skill in the art could isolate other dicamba-degrading oxygenases from other bacteria, it 
does not overcome the issue of written description. 

Applicants traverse the Examiner's rejection. Based on the Examiner's comments in the May 
28 Office Action, it appears to Applicants as though the Examiner's position rests on the contention 
that Applicants have not demonstrated where in the oxygenase sequence the substrate, dicamba, 
binds, and that there was insufficient knowledge in the art at the time of the invention for one of skill 
in the art to modify up to 35% of the amino acid sequence of the enzyme or produce a fragment 
without destroying the substrate specificity of the enzyme. The Examiner also appears to completely 
dismiss any other structure-to-function knowledge regarding this type of oxygenase that was known 
in the art at the time of the invention and concludes that there is "no described or art-recognized 
correlation or relationship between the structure of the invention and its function." Applicants 
strongly disagree with the Examiner's position for the following reasons. 

First, contrary to the Examiner's conclusions, Applicants have clearly demonstrated that, at 
the time of the present invention, there was a substantial amount of art-recognized correlation and 
relationship between the structure of Rieske non-heme iron-binding oxygenases that transfer 
electrons through iron-sulfur clusters and free iron atoms and the function of such enzymes. As 
previously stated, the oxygenase of the present invention falls within this class of oxygenases. In 
particular, examination of the biochemical and physical properties of oxygenase Dlc , as well as its 
primary amino acid sequence, revealed that it contains two essential elements, a Rieske iron-sulfur 
cluster (see the specification at page 40, line 27 to page 41, line 1, wherein this iron-sulfur cluster 
is first identified) and a mononuclear iron binding site (page 52, lines 1 5- 1 8, for example, discloses 
the requirement for iron (Fe 2+ ) for activity). These specific structural regions, which have been 
further illustrated and discussed in the Weeks December 2002 Declaration in comparison to other 
oxygenases of the same or similar class that were known at the time of the invention (see Figure A), 
account for the majority of the functional capability of the claimed oxygenase and the identifying 
motifs for such structures were known in the art at the time of the invention. 

Indeed, at the time of the invention, numerous examples of multicomponent oxygenases were 
documented in the literature, including those containing terminal oxygenases of the Rieske non-heme 
iron oxygenase type of the present invention. The art knew the consensus binding sites for the 
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Rieske iron-sulfur clusters, and binding sites for the free iron in Rieske non-heme iron oxygenases 
were widely known (See, e.g., Mason J.R, and Cammack R. ( 1 992 )Annu Rev Microbiol. 46:277-305 
(full text article enclosed); or Butler, C. S. and Mason, J. R. (1997) Advances in Microbial 
Physiology 38, 47-84 (full text article enclosed); Neidle et al. (1991) J. Bacteriol. 173(17):5385- 
5395 (full text article enclosed); Junker et al. (1997) J. Bacteriol. 179(3):919-927 (full text article 
enclosed); Gurbiel et al. (1996) Biochemistry 35:7834-7845 (abstract enclosed); Harayama et al. 
(1992) Annu Rev Microbiol. 46:565-601 (abstract enclosed)). Further, these binding sites in 
oxygenase DIC are illustrated in Figure A of the Weeks Declaration submitted December 2002, and 
could be readily identified by one of skill in the art. Again, these are significant portions of the 
structure of the oxygenase that are required for the functionality of the oxygenase Dlc , and it is 
information that one of skill in the art at the time of the invention would absolutely use to determine 
where in the protein modifications will be best tolerated. Therefore, this information is relevant to 
the issue of whether one knows what modifications to the claimed enzyme will affect biological 
activity. 

Furthermore, at the time of the invention, it was widely known by biochemists, molecular 
biologists, and many other chemists and biologists that any class of enzymes catalyzing a specific 
reaction dealing with a specific substrate (or set of specific required substrates) can be carried out 
with members of this class of enzymes that have variable amino acid sequences. As previously 
argued by Applicants, it was known from a multitude of published results that scientists could readily 
alter the amino acid sequence at several sites within any given enzyme without causing lose of 
enzymatic activity. This was known to be true for amino acid residues near or even within the 
active site domain or substrate binding sitefsl of an enzvme . In support of this position, Applicants 
refer to the prior discussion in the Weeks Declaration of December 2002 and the example 
referencing variants of cytochrome oxidases. 

It was also widely known at the time of the invention that members of a specific class of 
enzymes frequently contained motifs (domains) for binding ligands required for the reaction (NADH, 
ATP, iron-sulfur clusters, free iron, etc.) or for the proper structure and/or function of the enzyme. 
Such motifs usually are defined not by a precise amino acid sequence, but rather by a consensus 
amino acid sequence in which certain key amino acid residues (or close homologs) are usually 
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present and often are located similar distances apart. An example is the consensus sequence for the 
binding of a [2Fe-2S] cluster that comprises a Rieske iron-sulfur cluster, found in the oxygenase of 
the present invention. Note that the consensus sequence is C-X-H-X I6 -C-X 2 -H, where X can be any 
amino acid. However, there are cases known in which the number of amino acids between the 
normally invariant cysteine and histidine residues are different than the consensus above (See e.g., 
Mason J.R, and Cammack R. (1992) Annu Rev Microbiol. 46:277-305, article enclosed). That is to 
say, while consensus sequences for the known motifs can be easily recognized within an oxygenase 
of the type represented by the present invention, the exact amino acid sequence within a bona fide 
consensus sequence can be quite variable. It was also known at the time of the invention that there 
was a consensus binding site for free iron (i.e., a non-heme Fe(II) domain with the consensus 
sequence D/E-X 3 -D-X 2 -H-X 4 -H). Thus, the same logic put forth for variability in the amino acid 
sequences comprising the Rieske non-heme iron consensus sequence pertains to the consensus 
sequence for iron binding sites. Similarly, and of particular importance to the Examiner's apparent 
focus on the substrate binding site of the enzyme, it also was known at the time of the invention that 
within a substrate-binding site, there are only a few amino acids that form direct contacts with the 
substrate (See e.g., the chapter on Enzymes in: Nelson, D.L. and Cox, M.M. (2005) Lehninger 
Principles of Biochemistry, 4 th Edition, W.H. Freeman and Co., NY). Other surrounding amino acids 
often can be changed without significantly affecting the ability of the enzyme to bind the substrate 
and to catalyze the usual reaction involving the substrate. This is especially true of oxygenases, 
where the range of substrates is often quite large, an indication that the substrate binding region lacks 
significant geometric specificity (See e.g., ., Mason J.R, and Cammack R. (1992) Annu Rev 
Microbiol. 46:277-305; or Butler, C. S. and Mason, J. R. (1997) Advances in Microbial Physiology 
38, 47-84). 

Therefore, Applicants submit that the Examiner simply can not discount these structural 
features and conclude that there is no art-recognized correlation or relationship between the structure 
of the invention and its function. Such a conclusion is, on its face, completely incorrect and 
unsubstantiated by the art at the time of the invention. 

Second, Applicants submit that evidence has already been provided that, at the time of the 
present invention, one of skill in the art would have sufficient knowledge regarding where the 
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substrate for the oxygenase would bind, so that such person would easily be able to avoid disrupting 
this site, even without knowing exactly which residues contact substrate, and still modify up to 35% 
of the structure of the enzyme without destroying function. More specifically, as noted above and 
as previously stated, at the time of the invention, it was known that the binding site for free iron was 
situated in immediate proximity to the portion of the substrate that is modified in the chemical 
reaction catalyzed by the oxygenase DIC (i.e., the O-methyl group of dicamba) (see, e.g., Mason J.R, 
and Cammack R. (1992) Annu Rev Microbiol. 46:277-305; Junker, F., Kiewitz, R., and Cook, A.M. 
(1997) 179:919-927). 

The Examiner seems to assert in the May 28 Office Action that this argument is irrelevant 
because Applicants' specification did not specifically identify the sequence that binds dicamba. 
However, such an assertion is groundless, because Applicants have already demonstrated that, at the 
time of the invention, one can readily identify the free iron binding site in the claimed oxygenase 
(because the identifying consensus sequence or motif was well known) and therefore, one can 
certainly identify residues that are in close proximity to this binding site. The skilled in the art who 
wished to retain enzymatic activity while changing amino acid sequences of the enzyme would 
simply avoid residues in the immediate vicinity of the free iron-binding site. Importantly, because 
the essential free iron binding site residues and the substrate binding site comprise only a limited 
portion of any Rieske non-heme iron oxygenase molecule, there would easily be available to one 
skilled in the art a very large number of amino acid sequences (easily up to 35%) that could be 
altered without significantly affecting the enzymatic activity of the enzyme, and without knowing 
the exact sequence of the substrate binding site. Referring again to the sequence of the oxygenase 
shown in Figure A, which represents less than half of the 340 amino acids in the full sequence of the 
oxygenase, one can see that the active sites are just a small portion of the total sequence. 

Applicants are not claiming any sequence of unrestricted structural features, but rather a 
sequence that is at least 65% identical to SEQ ID NO:4. Referring again to the December 2002 
Declaration of Dr. Weeks and to the additional papers presented herein, it is clear that one of skill 
in the art had ample knowledge to predictably modify up to 35% of the protein and retain enzyme 
activity. Therefore, it is clear that the present inventors were in possession of the claimed invention 
at the time of filing. 
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Third, Applicants have provided an abundance of evidence generated post-filing that simply 
substantiates Applicants' position that one of skill in the art could, given the description in the 
specification and the knowledge in the art at the time of the invention, envision the full scope of the 
oxygenase sequences as claimed, and then readily make and use such enzymes. This evidence 
supports Applicants' position that the invention was sufficiently described or supported by the 
knowledge in the art at the time the invention was filed. Applicants will not reiterate these data 
again here, but will remind the Examiner that Applicants have demonstrated: the identification, 
isolation and sequencing of dicamba-degrading oxygenases from three additional bacterial strains 
(having 90. 1 -99% identity to the claimed SEQ ID NO:4); the mutation of sites within the oxygenase 
while retaining biological activity; the comparison of the modeled tertiary structure of the 
oxygenase DIC with the known structure of naphthalene dioxygenase. 

Finally, the Examiner appears to acknowledge the post-filing evidence that one of skill in the 
art could isolate other dicamba-degrading oxygenases from other bacteria, and Applicants submit 
that, given that these sequences were obtained using the disclosed sequence of SEQ ID NO: 3 and 
that the specification teaches that naturally occurring oxygenase sequences from other dicamba- 
degrading bacteria exist and will have structures similar to that of SEQ ID NO: 3 and further taught 
methods to obtain and identify such sequences (e.g., see page 10, line 19 to page 11, line 4), this 
evidence demonstrates that Applicants were in possession of the invention as claimed in Claim 1 or 
2 and related claims at the time of the invention. 

In view of the foregoing remarks, Applicants respectfully request that the Examiner withdraw 
the rejection of Claims 1, 2, 4, 5, 7, 21, 24, 36,39, 44, 47, 48, 50-52, 54-56, 58-65, and 68-71 under 
35 U.S.C. § 1 12, first paragraph. 

Objection to the Specification and Rejection of Claims 1. 2. 4, 5. 7. 21. 24. 36.39. 44. 47. 48. 50-52, 
54-56. 58-65, and 68-71 Under 35 U.S.C. $ 1 12. First Paragraph (Enablemenf) : 

The Examiner has objected to the specification and rejected Claims 1, 2, 4, 5, 7, 21, 24, 
36,39, 44, 47, 48, 50-52, 54-56, 58-65, and 68-71 under 35 U.S.C. § 112, first paragraph, on the 
basis of enablement. Specifically, in response to Applicants' last arguments, the Examiner contends 
that the structure of an enzyme is integral to its function, but that Applicants have only provided 
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guidance to make and use an isolated DNA molecule encoding SEQ ID NO:4. The Examiner states 
that the "art teaches the cytochrome P450 enzymes interact with other proteins, in particular 
reductases that donate an electron during the reaction, and are associated with membrane structures 
within the cell" and concludes that one must also take into consideration structures of a P450 enzyme 
that are critical to association with other proteins involved in the transfer of electrons and 
associations within the membrane. The Examiner does not find Applicants' data regarding other 
bacterial dicamba-degrading oxygenases to be persuasive because they are greater than 90% identical 
to SEQ ID NO:4. Finally, the Examiner contends that Applicants have not addressed the issue of 
fragments claimed in Claim 2. 

Applicants traverse the Examiner's rejection. Initially, Applicants again note, as stated in the 
last response, the oxygenase of the present invention is not in the family of cytochrome P450 
oxygenases that use a heme group to transfer electrons . Rather, the oxygenase described in the 
present invention is a member of the Rieske non-heme iron-binding oxygenases that transfer 
electrons through iron-sulfur clusters and free iron atoms (see specification, Examples, March 2004 
response). Applicants have clearly demonstrated, referring to the discussion above under written 
description, that the specification teaches that both the iron-sulfur clusters and the free iron binding 
site are found in the claimed oxygenase and that at the time of the invention, one of skill in the art 
would readily identify both of these structures in the claimed oxygenase (see Weeks Declaration of 
December 2002 and Figure A). Therefore, contrary to the Examiner's contention, the specification 
and the guidance available in the art at the time of the invention provides sufficient guidance to one 
of skill in the art regarding the structures of the claimed Rieske non-heme iron-binding oxygenase 
that are required for enzyme activity. With regard to substrate binding, Applicants further note that 
it was known at the time of the invention that the binding site for free iron in a Rieske non-heme 
iron-binding oxygenase is situated in immediate proximity to the portion of the substrate that is 
modified in the chemical reaction catalyzed by the oxygenase DIC (i.e., the O-methyl group of 
dicamba). Therefore, one of skill in the art would also know where in the protein the substrate 
binding region will be located. 

As discussed previously, the specification and the art teach one of skill in the art how to make 
modifications to a nucleotide sequence and/or protein sequence, and based on the discussion 
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provided above, one of skill in the art would clearly be able to modify up to 35% of the protein 
sequence without undue experimentation while predictably maintaining protein function. Moreover, 
the specification provides assays by which dicamba-degrading catalytic activity can easily be 
evaluated. Applicants' post-filing data have demonstrated that one of skill in the art can easily 
produce mutations that do not disrupt protein function. Furthermore, post-filing evidence regarding 
the established substrate binding sites for a few Rieske non-heme iron-binding oxygenases indicate 
that the art was correct regarding the location of the substrate binding site being proximal to the free 
iron binding site (e.g., see Karlsson et al., (2003) Science 299: 1039-42 (cited in Weeks Declaration 
of 2004); see also Kauppi et al., (1998) Structure 6:571-586; and Carredano et al., (2000) J. Mol. 
Biol. 296:701-712). 

Applicants have also established that, as predicted by the specification, one can use the 
sequence for the oxygenase of the present invention and readily identify, isolate and clone additional 
dicamba-degrading oxygenases that fall within the scope of Claim 1 or Claim 2. The fact that the 
identified dicamba-degrading oxygenases from other bacteria are greater than 90% identical to SEQ 
ID NO:4 does not discount their value in demonstrating the ease in obtaining an oxygenase within 
the scope of Claim 1, nor even an oxygenase within the scope of Claim 2. Applicants additional 
arguments provided above demonstrate additional enablement support for the full breath of the 
present claims. 

With regard to biologically active fragments of SEQ ID NO:4, Applicants submit that the 
same arguments as set forth above apply to fragments of SEQ ID NO:4. Given the knowledge of 
where in the protein lie the structures required for function of the claimed oxygenase as described 
above, one of skill in the art would readily be able to produce truncated versions of the oxygenase 
that retain enzymatic activity. It is particularly noted that the iron-sulfur cluster and the free iron 
binding site (and by extrapolation, the substrate binding site) are not located at the C- or the N- 
terminus of the protein (see Figure A of the December 2002 Weeks Declaration). Therefore, one of 
skill in the art would be able to predictably produce fragments of SEQ ID NO:4 that would retain 
enzymatic activity. 
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In view of the foregoing remarks, Applicants respectfully request that the Examiner withdraw 
the rejection of Claims 1, 2, 4, 5, 7, 21, 24, 36,39, 44, 47, 48, 50-52, 54-56, 58-65, and 68-71 under 
35 U.S.C. § 1 12, first paragraph. 

Applicants have attempted to respond to all of the Examiner's concerns as set forth in the 
May 28 Office Action, and submit that the claims are in a condition for allowance. Any further 
concerns regarding these claims can be discussed with the below-named agent at (303) 863-9700. 


Respectfully submitted, 
SHERIDAN ROSS P.C. 


Date: 


'jb iUMbLk o? 1. dO O 1 ) 



Angehf Dallas Sebor 
Registration No. 42,460 
1560 Broadway, Suite 1200 
Denver, Colorado 80202-5141 
(303) 863-9700 
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Abstract 

The degradation of aromatic compounds by aerobic bacteria frequently begins 
with the dihydroxylation of the substrate by nonheme iron-containing di- 
oxygenases. These enzymes consist of two or three soluble proteins that 
interact to form an electron-transport chain that transfers electrons from 
reduced nucleotides (NADH) via flavin and [2Fe-2S] redox centers to a 
terminal dioxygenase. The dioxygenases may be classified in terms of the 
number of constituent components and the nature of the redox centers. Class I 
consists of two-component enzymes in which the first protein is a reductase 
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containing both a flavin and a [2Fe-2S] redox center and the second com- 
ponent is the oxygenase; Class II consists of three-component enzymes in 
which the flavin and [2Fe*2S] redox centers of the reductase are on a separate 
flavoprotein and ferredoxin, respectively; and Class III consists of three- 
component enzymes in which the reductase contains both a flavin and [2Fe- 
2S] redox center but also requires a second [2Fe-2S] center on a ferredoxin for 
electron transfer to the terminal oxygenase. Further subdivision is based on 
the the type of flavin (FMN or FAD) in the reductase, the coordination of the 
[2Fe-2S] center in the ferredoxin, and the number of terminal oxygenase 
subunits. 

From the deduced amino acid sequence of several dioxygenases the ligands 
involved in the coordination of the nucleotides, iron-sulfur centers, and 
mononuclear nonheme iron active site are proposed. On the basis of their 
spectroscopic properties and unusually high redox potentials, the [2Fe-2S] 
clusters of the ferredoxins and terminal oxygenases have been assigned to the 
class of Rieske-type iron-sulfur proteins. The iron atoms in the Rieske 
iron-sulfur cluster are coordinated to the protein by two histidine nitrogens 
and two cysteine sulfurs. 

INTRODUCTION 

The benzene nucleus is one of the most abundant chemical structures in the 
biosphere, originating from both natural and anthropogenic sources. The 
biodegradation of compounds containing this structure is carried out almost 
exclusively by microorganisms (26), and with few exceptions, the initial step 
is the introduction of two hydroxyl groups into the benzene ring. This action 
is a prerequisite for further fission and catabolism of the aromatic compound. 
The enzymes that perform this specialized function of catalyzing the insertion 
of molecular oxygen into the organic substrate are termed oxygenases. 

This review describes the properties and structure of a class of di- 
oxygenases that convert aromatic rings to cw-diols. The enzymes generally 
comprise several protein subunits, an iron center, iron-sulfur clusters, and a 
flavin, arranged in an electron-transfer chain. The structures of these com- 
ponents vary in several ways. The structural evidence derives from very 
limited X-ray crystallographic results, extensive amino-acid sequence data 
and comparison with other electron-transfer proteins, and the application of 
spectroscopic methods. 

Oxygenases 

The most extensively investigated oxygenase enzymes are those isolated from 
Pseudomonas species (54). They activate molecular oxygen and incorporate it 
directly into the chemical structure of the organic molecule. Although the 
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reactions catalyzed by oxygenases are highly exergonic, the free energy 
released is not conserved by the formation of ATP. Thermodynamically, the 
effect of the oxygenase reaction is to destabilize the aromatic growth substrate 
and render the catabolism of the aromatic compounds irreversible (22), 
The various oxygenases differ in structure, mechanism, and in coenzyme 
requirements. They can, however, be classified into two groups: the mono- 
oxygenases, which incorporate one atom of oxygen into one molecule of 
substrate, and the dioxygenases, which add both atoms of oxygen into the 
substrate. In monooxygenases, the other atom of, oxygen is reduced to water 
so that these enzymes function as part oxygenase and part oxidase; hence, 
they are also termed mixed-function oxidases (73). The distinction between 
dioxygenase and monooxygenase activities is, however, not always absolute. 
For example, both toluene and naphthalene dioxygenases can catalyze 
monooxygenase reactions as in the oxidation of indene and indan to 1-indenol 
and 1-indanol, respectively (112), while 4-methoxy benzoate monooxygenase 
dihydroxylates the vinylic side chain of 4-methoxystyrene with both atoms 
derived from molecular oxygen (114, 115). 

Dioxygenases are involved at several stages in the pathways for the catabol- 
ism of aromatic compounds and may be considered to fall into two groups: 
1. Dioxygenases involved in ring hydroxylation. These all require reduced 
cofactors, either NADH or NADPH, in addition to oxygen. They di- 
hydroxylate aromatic substrates to produce cw-diols, e.g. toluene di- 
oxygenase: 



dilrydrodiol 


2. Dioxygenases involved in ring fission. These have no cofactor require- 
ment and cleave the benzene ring of hydroxylated (di- or tri-) aromatic 
substrates, e.g. catechol 1,2-dioxygenase: 


OH 


OH 



COOH 

COOH 


catechol cts, cts-muconic add 


This review deals with the former group of dioxygenases, which are 
essential for the activation of aromatic compounds. 
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STRUCTURE OF RING-HYDROXYLATING 
DIOXYGENASES 

Gibson and coworkers (40, 41) were the first to show that the dioxygenase 
system of Pseudomonas putida oxidized benzene to generate c/j- 1,2- 
dihydroxy-cyclohexa-3, 5-diene. Several aromatic ring-hydroxylating di- 
oxygenases have since been isolated and characterized (2, 30, 49, 50, 101, 
102, 120, 123). The enzymes are nonheme dioxygenases that oxidize the 
aromatic substrates to give czs-dihydrodiols or cw-diol carboxylic acids. They 
require, in addition to molecular oxygen, cofactors, including reduced pyri- 
dine nucleotides and Fe(II), for the reactions they catalyze. 

In the ring-hydroxylating oxygenases so far investigated, the preferred 
reductant is NADH. All are soluble, multicomponent enzymatic systems 
comprising the same types of redox components (Table 1). The details of the 
electron-transfer chains have not been elucidated in all cases; however, a 
typical arrangement is a short electron-transfer chain: 


NADH — > Flavin — * Fe-Spd Fe-SR — > Fe — ► O 2 

Reductase site Electron-transfer chain Oxygenase site 


Here, Fe-Spd and Fe-SR represent ferredoxin-type and Rieske-type [2Fe- 
2S] iron-sulfur clusters respectively, as defined below. Fe represents the 
iron-binding site proposed to be the terminal oxygenase center. 

The distribution of the redox components, and the number and size of 
protein subunits, depend on the type of the ring-hydroxylating dioxygenases 
being considered (Table 1). All of the terminal dioxygenase proteins of this 
class consist of an iron-sulfur protein with a Rieske-type [2Fe-2S] cluster and 
an iron site, but the reductase chain, which transfers reducing equivalents 
from NADH to the terminal dioxygenase, may consist of either one or two 
separate proteins. 

Reductase System 

Table 1 shows that the reductase chains of different ring-hydroxylating di- 
oxygenases occur in three different arrangements, depending on the number 
of subunits and the disposition of iron-sulfur clusters. They may be further 
subdivided on the basis of the types of iron-sulfur clusters and flavin present: 

1 . Two-component dioxygenases, in which the flavin and iron-sulfur cluster 
are combined in the same protein. Examples include benzoate 1,2- 
dioxygenase (EC 1.13.99.2) (119), phthalate dioxygenase (EC 1.14.12.7) 
(5), 4-chlorophenylacetate-3,4-dioxygenase (71, 95), and 4-sulfo- 
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The classes of dioxygenases are as defined by Batie et al (4). 
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benzoate-3 ,4-dioxygenase (69). These are Class I dioxygenases in the 
classification of Batie et al (4) and are subdivided into Class IA, contain- 
ing FMN, and IB, containing FAD. 

2. Three-component dioxygenases, in which the electron-transfer chain com- 
prises a flavoprotein and a separate iron-sulfur protein (ferredoxin). Class 
IIA comprises pyrazon dioxygenase (EC 1.14.12), which has a plant-type 
ferredoxin as judged by its electron paramagnetic resonance (EPR) spec- 
trum (93). Class IIB includes benzene dioxygenase (EC 1 .14. 12.3) (2, 39) 
and toluene dioxygenase (101, 102) which, as discussed later, appear to 
have Rieske-type iron-sulfur clusters in their ferredoxins. 

3. Three-component dioxygenases, in which the electron-transfer chain com- 
prises both an iron-sulfur flavoprotein and a ferredoxin. The sole represen- 
tative, so far, of Class III is naphthalene dioxygenase (49, 50). 


The differences between the dioxygenases may be related to function; the 
products of the three-component dioxygenases are all cis-dihydrodiols, 
whereas those of the two-component dioxygenases are either cis-diol 
carboxylic acids or catechols. In the latter case (e.g. 4-sulfobenzoate-3,4- 
dioxygenase), a hypothetical dihydrodiol intermediate represents a highly 
unstable configuration that spontaneously loses a group (sulfite or chloride) 
with the concomitant energetically favorable rearomatization. 


related systems The electron-transfer chains of the aromatic di- 
oxygenases may be considered in the wider context of mixed-function ox- 
idases. This review refers to other oxygenases that have similar electron- 
transfer chains from NAD(P)H. 

For example, 4-methoxybenzoate monooxygenase from P. putida (DSM 
No. 1868) is a closely related system that has a similar reductase chain. It 
does not catalyze dioxygenation of the aromatic ring (9), but as already noted, 
it has dioxygenase activity towards certain substrates (115). It comprises a 
nonheme iron-containing terminal oxygenase, which Bernhardt and cowork- 
ers (8, 109) named putidamonooxin, and a reductase that contains FMN and a 
[2Fe-2S] cluster. The vanillate demethylase of Pseudomonas strain ATCC 
19151 catalyzes a similar monooxygenase reaction and consists of two similar 
components, the sequences of which were recently determined (16). 

Another similar oxygenase is the cytochrome P-450 monooxygenase, in 
which the terminal oxygenase component is heme (83). All P-450 systems 
incorporate a pyridine nucleotide-linked flavoprotein, and some, but not all, 
involve ferredoxins with [2Fe-2S] clusters. They are typified by the well- 
studied camphor monooxygenase of P. putida (strain CIB), which comprises 
a flavoprotein (putidaredoxin reductase), a ferredoxin (trivial name puti- 
daredoxin), and the terminal monooxygenase P-450cam (28, 98). Overall, 
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more is known about the mechanism of oxygen activation in the P-450 
monooxygenases than in any other hydroxylase. 

Finally, the third similar oxygenase is methane monooxygenase, in which 
the terminal oxygenase component is a dimeric nonheme iron cluster, prob- 
ably with fx - oxo bridging ligands (34, 85, 118). This has a NADH-linked 
reductase comprising an iron-sulfur flavoprotein (70, 86). 

We now consider the various components of the aromatic ring- 
hydroxy lating dioxygenases, and what is known about their composition, 
catalytic activity and structure. 

simple flavoproteins In the three-component dioxygenase systems, the 
flavoprotein, the first component of the electron-transfer chain, acts as an 
oxidoreductase catalyzing the transfer of two electrons from NADH. The 
flavoprotein consists of a single polypeptide with relative molecular mass (M r ) 
ranging from 42 to 67 kilodaltons (kDa) (Table 2). The protein is reportedly 
monomeric for toluene and pyrazon dioxygenases (93, 101). It contains FAD 
as the only prosthetic group, which is readily lost during purification (50, 
101). These flavoproteins belong to a large family of FAD-containing pyri- 
dine nucleotide oxidoreductases. They vary in size, substrate specificities, 
and preference for NADH or NADPH (51). 

The reductase components of the benzene dioxygenase and toluene di- 
oxygenase contain FAD as their prosthetic groups (2, 101). The flavoproteins 
show absorption spectra typical of FAD with maxima at around 375 and 450 
nm. They function as oxidoreductases by transferring electrons from NADH 
to the ferredoxin components. Both enzymes are specific for NADH. By 
contrast, the flavoprotein of naphthalene dioxygenase (50) is significantly 
reduced when NADPH is substituted for NADH as the coenzyme. 

The probable cofactor binding sites in the reductase proteins may be 
inferred from their amino acid sequences and comparison with other flavopro- 
teins. NADH-dependent reductases of dioxygenases may be compared with 
the FMN-containing putidaredoxin reductase, which performs an analogous 
function in the cytochrome P-4 50-dependent camphor hydroxylase (63, 91). 
Other NADH-dependent flavoproteins found in Pseudomonas sp. include 
dihydrolipoamide dehydrogenase [P. fluorescens (7)] and the flavin- 
containing monooxygenases, p-hydroxyphenylacetate hydroxylase (87) and 
salicylate hydroxylase (61). Examples of NADPH-dependent flavoproteins 
include mercuric reductase [P. aeruginosa (15)], glutathione reductase [P. 
aeruginosa (N. L. Brown, cited in 96)], and p-hydroxybenzoate hydroxylase 
[P . fluorescens (56)]. (Note that the above-mentioned flavoprotein monoox- 
ygenases contain no metal centers.) 

The proteins are each expected to contain two binding sites for ADP 
moieties, one in FAD and the other in NAD. For convenience, these sites are 
referred to as the FAD-binding and NAD-binding sites. Common to the 
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FAD-binding (ADP-binding) site of the flavoprotein oxidoreductases is the 
highly conserved sequence 

Gly-X-Gly-X-X-Gly-X-X-X-Ala-X-X-X-X-X-X-Gly, 

where X represents any amino acid, situated near the amino terminus of the 
protein (7,51, 99). The Gly-X-Gly-X-X-Gly sequence constitutes a tight turn 
between the first strand of a /3-shcct and the succeeding a-hclix (117). The 
NAD-binding (ADP-binding) domain of many enzymes has also been re- 
ported to be a /3a/3-fold centered around a highly conserved Gly-X-Gly-X-X- 
Gly sequence (7, 96). A comparison of the primary structure of reductase B ED 
and reductase-roL with the amino acid sequence of other flavoprotein ox- 
idoreductases reveals two such regions as shown in Figure 1. The first 
sequence of Gly-X-Gly-X-X-Gly-X-X-X-Gly-X-X-X-X-X-X-Gly, which is 
conserved in the dioxygenase flavoproteins examined, is located close to the 
amino terminus of the protein and is most probably the FAD-binding site of 
the reductase component of the dioxygenase. 

Some 140 residues along the protein from the first conserved Gly, one finds 
another di nucleotide-binding consensus sequence, although with significant 
differences (Figure 2). First, the conserved alanine residue in the dioxygenase 
reductase is replaced by a glycine in the other oxidoreductases. Second, a 
glutamate residue is always seven residues farther along the protein from the 
last conserved glycine. This negatively charged residue at the carboxy end of 


Enzyme Amino acid sequence 


ReductaseBED 
ReductaseTOL 
LipoamidcDH 
PutidarodoxinR 
p-OH benzoateH 
Consensus sequence 

Secondary structure 


' tlRNHUR I I GNGUAGFTTRQRLRAEGYEGR I SL I GEEQHLP 

* MRTHUR 1 1 GNGUGGFTTRQRLRREGFEGR I SL I GDEPHLP 

2 QKFDUUU I GRGPGGVUflfl I RARQLGLKTRC I EKY I GKEGK 

3RNDHUU I UGTGLflGUEURFGLRRSGUEGM I RIUGDRUJ I P 

2 l1KTQUfl 1 1 GRGPSGLLLGQLLHKRG I DNU I LERQTPDYUL 

G G G R G 
G 

PPPPPPTaaoaaaaaaaaaaT tpppppp 


Figure 1 Alignment of the putative FAD-binding sites of reductase B eo and reductase T oL with 
those of other Pseudomonas oxidoreductases. The numbers indicate the position of the amino 
acid in the protein sequence. In the secondary structure, a, /3, and T indicate a-helix, /3-sheet, 
and turn, respectively. The sequences presented are: reductase TOL (127); lipoamide de- 
hydrogenase (7); putidaredoxin reductase (90), and p-hydroxybenzoate hydroxylase (113). 
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the second /3-strand is involved in binding the 2* -OH of the ribose of NADH 
(96). In NADPH-dependent oxidoreductases, this residue is replaced by an 
uncharged amino acid, presumably to accommodate the 2' -phosphate group 
on the coenzyme. The other difference in the NADP-binding domain is the 
replacement of the third glycine residue of the conserved trio by alanine, with 
another alanine four residues farther along the helix. The substitution of 
alanine for glycine is thought to modify the local polypeptide conformation 
associated with the difference in coenzyme specificity (96). These differences 
in the amino acid sequence could explain why NADPH cannot replace NADH 
as the coenzyme in the benzene and toluene dioxygenase systems (2, 101). 
The other salient feature of the two conserved regions in all these enzymes is 
their similar distance from each other, which could indicate similar overall 
folding of the polypeptide. 


iron-sulfur flavoproteins In contrast to the two-component reductase 
chains in which the iron-sulfur cluster resides on a separate polypeptide, 
dioxygenases have been isolated in which this redox center is arranged on the 
same polypeptide as the flavoprotein. Table 2 lists the properties of these 
iron-sulfur flavoprotein reductases. There are two classes, depending on 
whether they contain FAD or FMN. They each contain a [2Fe-2S] cluster. 

Although they fulfill the same role as the two-component electron-transfer 


Enzyme 


Amino acid sequence 


ReductaseBED 

ReductaserOL 

Li poamide dehydrogenase 
P. fluorescens 

A. vindandii 

E. coti 

Consensus sequence 
Secondary structure 


M, TPHTRLLIUGGGLI CCEUaTTflRKLCLSUTI LERGDELLU 
M, T$RTRLL I UGGGL I GUEUBTTRRKLCLSUT I LERGDELLU 

t76 AUPKKLGU I GRGUI GLELGSUURRLGREUTULERLDKFLP 

178 NUPGKLGU I GRGUI GLELGSUURRLGflEUTULEflflDKFLP 

,78 EUPERLLUf1GGG 1 1 GLEtlGTUVHALGSQ I DUUEnFDQU I P 

G G G R G E 
G 

ppppppTaaaaaaaaaaaaaTTpppppp 


Figure 2 Alignment of the putative NAD-binding sites of reductase B ED and reductasexoL with 
those of other oxidoreductases. The numbers indicate the position of the amino acid in the protein 
sequence. In the secondary structure, a, /3, and T indicate a-helix, /3-sheet, and turn, respective- 
ly. The sequences presented are: reductase BED (59), reductase TO L (127), and lipoamide de- 
hydrogenase [P. fluorescens (7), Azotobacter vinelandii (116), £. coli (99)]. 
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systems, these “true” reductases show little homology with the former. Thus, 
the reductase component of the benzoate 1,2-dioxygenase, although show- 
ing a high degree of immuno-cross-reactivity with the toluate dioxygenase, 
showed no such homology with the components of the benzene or naph- 
thalene dioxygenases (75). 

The recently determined structure of phthalate dioxygenase reductase (4) 
shows that the protein comprises three domains. The N-terminal domain, 
containing FMN, has the same structural topology as the plant ferredoxin: 
NADP reductase (62), consisting of a six-stranded /3-barrel. The intermediate 
NAD-binding domain has the typical Rossman all 8 fold of nucleotide-binding 
proteins. The C-terminal domain resembles the plant-type ferredoxins from 
cyanobacteria (35, 92, 108). The sequences of the iron-sulfur flavoproteins of 
the benzoate and toluate dioxygenases (79) show homologies with the plant- 
type ferredoxins, as do the reductases of vanillate demethylase (16), xylene 
monooxygenase (103), and methane monooxygenase (97). 

The iron-sulfur flavoproteins can transfer electrons directly from NADH to 
an artificial electron acceptor. Thus, the reductase components of benzoate 
1,2-dioxygenase, phthalate dioxygenase, and 4-chlorophenylacetate 3,4- 
dioxygenase reduce cytochrome c directly in the presence of NADH (5, 95, 
119, 123). These proteins are monomeric with M t of 34-38 kDa and contain a 
[2Fe-2S] cluster in addition to the flavin prosthetic group. Consequently, the 
absorption spectra of the oxidized proteins show absorption maxima at 340 
and 460 nm. The EPR spectra of the reduced proteins show typical features of 
plant-type [2Fe-2S] clusters, together with a signal around g = 2.004 from the 
flavosemiquinone radical (5, 95). 

Yamaguchi & Fujisawa (119) studied the iron-sulfur flavoprotein (de- 
signated NADH-cytochrome c reductase) of the benzoate 1 ,2-dioxygenase of 
P. arvilla. They succeeded in removing the [2Fe-2S] cluster by treatment with 
/7-chloromercuriphenyl-sulfonate and in reconstituting it by incubation with 
iron, sulfide, and mercaptoethanol (121). This technique has been suc- 
cessfully applied to ferredoxins (e.g. 107a), but it is unusual to achieve 
reconstitution with an enzyme of this complexity. On removal of the [2Fe-2S] 
cluster, the protein retained some ability to transfer electrons from NADH to 
artificial acceptors such as ferricyanide (60%), but not to the dioxygenase 
protein (less than 1%). Full activity was restored when the [2Fe-2S] cluster 
was replaced. 

Ferredoxin NA p reductase resembles the reductases described above in that it 
can catalyze the reduction of cytochrome c independently. The absorption 
spectrum is also similar to the spectra of the iron-sulfur flavoproteins with 
absorption maxima at 340, 420, and 460 nm, and a broad shoulder at 540 nm. 
Unlike these proteins, however, ferredoxin NAP reductase requires an addition- 
al protein, ferredoxin NAP to transfer electrons to the terminal dioxygenase 
component (49). 
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ferredoxins The intermediate electron-transfer carrier in the aromatic 
dioxygenases is a ferredoxin containing an iron-sulfur cluster of the [2Fe-2S] 
type. Ferredoxins are small proteins that contain iron-sulfur clusters, whose 
sole function is to transfer electrons (17). 

Several different types of [2Fe-2S] clusters are known in proteins. The 
largest class consists of the plant-type ferredoxins, of which the first known 
example was spinach ferredoxin, found in chloroplast membranes. Similar 
proteins have been found in cyanobacteria, and the structures of three of these 
have been determined (35, 92, 108). Ferredoxins with similar properties are 
involved in electron transfer to certain cytochrome P-450-dependent monoox- 
ygenases; these include putidaredoxin from P. putida camphor monooxyge- 
nase, and adrenodoxin of the steroid monooxygenase of mammalian adrenal 
mitochondria. NMR studies indicate that the structures of these proteins have 
a similar fold (45, 82). All of these are small hydrophilic proteins containing 
one [2Fe-2SJ cluster per molecule, serving as one-electron carriers, the 
[2Fe-2S] clusters contain two atoms of iron, bridged by two labile sulfide 
atoms, arid coordinated to the protein by four sulfide ligands contributed by 
four cysteine residues. They are reddish in color; the visible absorption 
principally results from sulfur-to-Fe(III) charge-transfer transitions (81). 
Upon reduction, one of the iron ions is reduced to Fe(II), and the visible 
absorption decreases to about half. 

57 Fe Mossbauer spectroscopy enables the investigator to distinguish the 
individual iron atoms in the iron-sulfur clusters. In reduced ferredoxins, the 
spectra of Fe(III) and Fe(Il) ions can be clearly identified as quadrupole 
doublets with chemical shifts characteristic of their valence states (20, 27, 
78). The electronic structure of the [2Fe-2S] clusters is such that the spins on 
the iron ions are antiferromagnetically coupled. As a result, the oxidized 
protein has a ground state with zero spin, and there is no electron paramagne- 
tic resonance (EPR) spectrum. Upon reduction, one of the iron ions becomes 
high-spin Fe(II), and the net spin of the coupled Fe(III) and Fe(II) ions is S = 
1/2. Thus, unlike most iron compounds, the [2Fe-2S] clusters give an EPR 
signal in the reduced state, when measured at low temperatures, which has an 
average g-factor of less than 2 (44). In the plant-type ferredoxins g av is around 
1.96. 


Plant-type ferredoxin 


Cys-S ,.S. S-Cys 


Fe' 


Fe: 


3 + 


Cys-S 


/ \ 


S-Cys 


le' 




Cys-S S-Cys 

Fe** 

Cys-S^ ^ S-Cys 


Oxidized 


Reduced 
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Another class of proteins that contain [2Fe-2S] clusters comprises the 
Rieske proteins. They are named after I. S. Rieske, who described a protein 
isolated from the cytochrome bc { complex of mitochondria (89). Similar 
proteins are associated with the b 6 -f complex of the thylakoid membrane of 
chloroplasts (58, 88) and the plasma membrane of some aerobic and 
photosynthetic bacteria (32, 36); for a recent review on microbial be com- 
plexes, see the report by Trumpower (106). Among the distinguishing proper- 
ties of the Rieske iron-sulfur clusters are red-shifted visible spectra with 
maximum absorbance about two-thirds those of the plant-type ferredoxins and 
relatively positive midpoint redox potentials. In the EPR spectra of the 
reduced proteins, g av is lower, around 1.91, with a characteristic sharp 
derivative peak at g = 2.02 and a broad trough around g =* 1.80. The 
plant-type and Rieske-type [2Fe-2S] clusters can be distinguished using sever- 
al different spectroscopic methods, including optical absorption, EPR, Moss- 
bauer, resonance Raman, and X-ray absorption. As explained below, there is 
evidence that two of the ligands to the Rieske iron-sulfur cluster are the 
nitrogens of histidine residues (33, 48). 

The ferredoxins of the ring-hydroxylase dioxygenase systems are small 
acidic proteins of M r ranging from 12 to 15 kDa, containing 2 gram atoms of 
iron and acid-labile sulfide per mole (Table 3). The ferredoxins are one- 
electron acceptors (25, 102). In addition to transferring electrons from the 
flavoprotein to the terminal oxygenase component, the ferredoxin catalyzes a 
flavoprotein-dependent reduction of other electron acceptors such as 
cytochrome c (2, 102). 

Gibson and coworkers found that analogous electron-transfer proteins, such 
as spinach ferredoxin, putidaredoxin, adrenodoxin, and the 2[4Fe-4S] ferre- 
doxin from Clostridium pasteurianum, cannot replace ferredoxin TO L in either 
cytochrome c reduction or toluene oxidation (43, 102). Similarly, ferredoxin- 
tol could not substitute for ferredoxin NA p in the naphthalene dioxygenase 
system (49). Thus, the ferredoxins are apparently rather specific for the 
dioxygenase system, in which they act as electron-transfer proteins. This 
specificity is reflected in the primary structure of the proteins. 

The ferredoxins of the different ring-hydroxylating dioxygenases are sim- 
ilar in size: ferredoxin NAP comprises 104 amino acids, while ferredoxin B ED 
and ferredoxin TO L have three amino acids more. Ferredoxin B ED and ferredox- 
inTOL are almost identical in sequence, but ferredoxin NAP reportedly shares 
only 34% homology with ferredoxin BED in amino acid composition (67). In 
all other [2Fe-2S] ferredoxins [with the exception of a [2Fe-2S] protein of C. 
pasteurianum, which appears to represent yet another family of iron-sulfur 
proteins (74)1, the cysteine residues involved in binding the cluster are 
arranged as 


Cys-X-X-X-Cys-X-X-Cys-29 amino acids-Cys 
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Dioxygenase Compone nt group A m „ (nm) EPR g x ,g y ,g 2 potential (mV) References 

Benzene B (Ferredoxin BE D) [2Fe-2S] 280,320,456 1.834,1.890,2.026 -155 25,77 

Toluene Ferredoxin-roL [2Fe-2S] 277,327,460 1.81,1.86,2.01 -109 102 

Naphthalene Ferredoxin NAP [2Fe-2S] 280, 325, 460 49 

Pyrazon B [2Fe-2S] 411, 453 1.94, 2.02 93 
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where X denotes any amino acid (125). The arrangement of the cysteines in 
the dioxygenase ferredoxins is quite different, however. Figure 3 shows an 
alignment of the conserved regions in the four sequences determined so far. 
Because the ferredoxin serves as an intermediate electron carrier in these 
dioxygenases, it is conceivable that some of the conserved regions in the 
proteins represent domains involved in recognition of the flavoprotein and 
terminal oxygenase components. However, examination of the conserved 
regions shows that they include three cysteine and two histidine residues, 
suggesting that these include ligands involved in binding to the [2Fe-2S] 
cluster, in which case the ligation is different from that in the plant-type 
ferredoxins. 

The EPR spectrum of the ferredoxin of the pyrazon dioxygenase resembles 
those of the P-450 monooxygenases such as putidaredoxin with g-factors of 
2.02 and 1.94 (93), indicating that it is a typical [2Fe-2S] cluster with cysteine 
ligands. By contrast, the fact that ferredoxins of the toluene and benzene 
dioxygenases have unusual [2Fe-2S] clusters is shown by their spectroscopic 
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Figure 3 Conserved cysteines and histidines in the putative cluster-binding sequences of 
Rieske-type [2Fe-2S] clusters in ferredoxins, dioxygenases, and Rieske proteins. Ferredoxins: 
ND, naphthalene dioxygenase from F. putida 9816 (67); BED1, benzene dioxygenase from P. 
putida ML2 (77); BED2, benzene dioxygenase from P. putida BE81 (59); TOD, toluene 
dioxygenase from P. putida FI (127); CYPETC, Rieske iron-sulfur protein (PetC) from the 
cyanobacterium Nostoc PCC 7906; SCRIP 1, Rieske iron-sulfur protein 1 from Saccharomyces 
cerevisiae (6); NCUCR, ubiquinol-cytochrome c reductase from Neurospora crassa (55). 
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and redox properties (Table 3). Evidently, they have features that differ from 
the plant and other bacterial [2Fe-2S] ferredoxins. These include the absence 
of the typical doublet peak in the 410- to 463-nm region of the absorption 
spectrum exhibited by other well-studied [2Fe-2S] ferredoxins (81). The 
molar absorbance at A max = 460 nm is only about 70% of the value for 
plant-type ferredoxins at A max — 415 nm (102), and is nearly equivalent to 
those for Rieske iron-sulfur proteins (33). The midpoint redox potentials of 
the ferredoxins of benzene (ferredoxin B ED) and toluene dioxygenase (ferre- 
doxin TOL ) are —155 mV and —105 mV, respectively (39, 102). This is 
significantly more positive than spinach ferredoxin [—420 mV (104)], adre- 
nodoxin [—270 mV (57)], or putidaredoxin [—239 mV (46)], but more 
negative than the Rieske clusters, which range from -140 to +350 mV (19, 
65). Furthermore, the EPR spectra of reduced ferredoxin B ED and ferredox- 
in XO L have average g-factors of 1.92 and 1.84, respectively. These may be 
compared with g av = 1.96 for most plant-type ferredoxins (11), and g av = 
1.91 for the Rieske-type proteins (32, 48). These properties appear more 
similar to those of the Rieske-type iron-sulfur proteins (32, 33) than the 
plant-type [2Fe-2S] ferredoxins. 

The Catalytic Terminal Oxygenase Component 

The terminal oxygenase, also an iron-sulfur protein, is the catalytic com- 
ponent of the dioxygenase enzyme. For hydroxylation of the aromatic sub- 
strate, both O 2 and Fe 2+ must be present. Thus, in addition to a substrate 
binding site, the terminal dioxygenase component also contains an iron- 
binding site and a Rieske-type [2Fe-2S] cluster (5, 39, 73). 

The terminal dioxygenase components are large oligomeric proteins of M r 
150-200 kDa (Table 4). The majority of them have two dissimilar subunits, a 
and (5 of about 50 and 20 kDa, respectively. They are organized either as a 2 ^2 
for benzene, toluene, and naphthalene dioxygenases (42, 93, 126) or a 3 /3 3 for 
benzoate dioxygenase (122). The exceptions are the terminal dioxygenase 
components of 4-sulfobenzoate 3,4-dioxygenase (69), 4-chlorophenylacetate 
3,4-dioxygenase (72), and phthalate dioxygenase (5), which comprise two, 
three, and four identical monomers, respectively. All the terminal di- 
oxygenase components have [2Fe-2S] clusters of the Rieske-type with one 
[2Fe-2S] cluster per ajS dimer or a monomer. For the terminal dioxygenase 
components with an a(3 dimeric configuration, research has indicated that the 
a subunit contains the [2Fe-2SJ cluster [benzoate 1,2-dioxygenase (122)], 
while the /3 subunit has been implicated as the subunit involved with substrate 
recognition [toluate 1,2-dioxygenase (53)]. 

iron center of the terminal oxygenase Several aromatic di- 
oxygenases require Fe(II) for activity. The iron is presumed to bind to a 
specific site in the protein, where it may be oxidized to Fe(IU). The iron- 
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binding site is the least well understood component, so any discussion of its 
structure and function will necessarily be speculative. A center capable of 
single-electron transfer is required for efficient activation of the O 2 , which has 
a triplet ground state. Although all the terminal dioxygenases contain Rieske- 
type clusters, there is no precedent for a [2Fe-2S] cluster serving as an 
oxygen-binding site; the sulfide is prone to oxidative damage. Analogy with 
other types of oxygenases indicates that oxygen activation occurs at the iron 
site. Several classes of oxygenase involve an iron site without an NADH- 
linked reductase chain. An example is protocatechuate dioxygenase from P. 
aeruginosa ; the structure of this molecule has been determined using X-ray 
crystallography (80). The protein ligands to the iron were found to be two 
tyrosines and two histidines. Spectroscopy experiments revealed that tyrosine 
is a ligand in other nonheme iron-containing dioxygenases, including 4- 
hydroxyphenylpyruvate dioxygenase (14). If the iron represents the site at 
which O 2 reacts, it is presumed to be close to the substrate-binding site. 

Amino acid sequences of the terminal oxidases of benzene, toluene, benzo- 
ate, toluate, and naphthalene dioxygenases all contain, in addition to the 
characteristic Rieske sequence near the N terminus (Figure 4), two conserved 
histidines and two conserved tyrosines near the middle of the sequence (59, 
67, 79, 127). This site represents a likely binding domain for the iron center. 
Interestingly, the iron-containing monooxygenases for alkanes (64) and 
xylene (103) also have a homologous region rich in histidines and tyrosines, 
which is dissimilar from those of the dioxygenases. The iron-binding site 
presumably dictates the type of oxygenation reaction that takes place. 

One should be able to observe this reaction using EPR, though for some 
types of ligand field, the reaction may become too broad to be detected. No 
EPR signals due to high-spin Fe(III) have yet been reported for the aromatic 
dioxygenases. The comparable iron site in 4-methoxybenzoate monooxyge- 
nase has been examined using EPR and Mossbauer spectroscopy (12, 13, 
109). In the oxidized protein, the iron center is expected to be high-spin 
Fe(III). A distribution of a complex series of EPR signals was around g = 6.0 
and 4.3, corresponding to high-spin Fe(IlI) in a tetragonal and rhombic 
symmetry, respectively. The g = 6.0 signals shifted upon addition of sub- 
strates; the g = 4.3 did not. A comparison of enzyme reoxidized by oxygen 
with enzyme oxidized anaerobically demonstrated that these signals do not 
correspond to Fe^II^CV substrate complexes. 

The mechanism of the dioxygenase reaction remains to be elucidated. A 
proposal by Twilfer et al (110) envisaged the activation of oxygen at the iron 
center by successive electron transfers from the reductase: 


1 e" 0 2 e - 

Fe (III) Fe (II) -» Fe (II) • 0 2 -*■ Fe (III) • 0 2 “ Fe (II) • 0 2 ~ -*■ Fe (111) ■ 0 2 1 2 ~ 




Annual Reviews 

www. annualrevi e ws . org / aronl ine 


DIOXYGENASE REDOX COMPONENTS 295 


bedCI nHQTETTP I RURKNUKTSE I ETLFOEQflGR I OPR I VTDEOLVQLEtERUFflRSULLLGH 
ndoB rtNVNNK I LUSESGLSQKHL I HGDEELFQHELKT I FRRNULFLTH 

bend rtPR I PU I HTSHLOR I DELLUDNTETGEFKLHRSUFTDQRLFOLEHKV | FEGNUUVLRH 

* * * * * * * 

bedCI ETHIRKPGDYFTTVflGEOPUUUURQKDflSIflUFLHQCRHRGflRlCRSDftGHflKflFTCSV 
ndoB LSL I PRPGDYUTflKflG I DEG I USRQHOGS I RRFLNUCRHRGICTLUSUERGNRKGFUCSY 

bend ESO I PNNNDYYTTY I GRQP I L 1 RRHPNGELNflM I NRCSHRGRQLLGHKRGNKTTYTCPF 

* ** * * * * * *** ** * 

bedCI HGURYDTRGNL I HUPVEAES-FAC — LOKICEUSPLIC — HRUETVKGL I FRNUDEHR I DL 
ndoB HGUGFGSHGELQSUPFEKDL-VGES-LHKKCLG-LKEUflRUESFHGF I YGCFDQEflPPL 
bend HGUTFNNSGiaLKUKDPSDRGVSDCFHQDGSHD-LKICUflRFESYKGFLFGSLHPUDPSL 

*** * * * ****** * 

bedCI DTYLGERKFYnDHILDRTERGTEUIPGIQKUUIPCNUKFflflEQFCSDnYHflGTTRHLSG 
ndoB tlDVLGDRflUVLEPnFKHS-GGLELUGPPGKUU I KANUKAPRENFUGDAYHUG-UTHRSS 

bend QEFLGETTKI IDUIUGQSOQGLEULRGUSTYTYEGNUICLTREN-GRDGVHUS-RUHUNY 

** * * *** ** * ** * 

bedCI I I RGLPEDLELRDLR — PP-KFGKQ YRRSUGGHGSGF Y i GDPNnnLflnnGPICUTS 

ndoB LRSGES IFSSLflGNRRLPPEGflGLQfITSKYGSGUGULUDG-YSGUHSflDLUPELnflNGG 

bend RRTTQHRKEKQRGDT- - 1 RfitlSfiGSUGKHGGGSYGFEHGHflLmTQUGNPE 

* * * 

bedCI YLTEGPRREKflflERLGSI ERGTKinLEHnTUFPTCSFLP GUNT I RTUHPRGPHEGE 

ndoB RKQERLNKE I GDURflR I YRSHLHCTUFPHHSnLTCS GUFKUUNP I DRHTTEUU 

bend DRPNFPKRREYTEKFGfiflnSKUn I ERSRNLCLYPNUYLtlDQFGSQ I -RULRP I SUNKTE 

* 

bedCI UURFTUUDRDflPOD I KEEFRRQTLRTFSRGGUFEQDDGENUUE I QH I LRGHKRRSRPFN 
ndoB TYR I UEKDfl — PEDLICRRLfiDSUQRTFGPflGFUESDDNDNflETfiSQNGKKYQSRDSDLL 

bend UT I VC I RPUGEflPERRflRR I RQVEDFFNRSGnRTPDDlEELPRCQRGYRG I ELEUKOflC 

* * ** 

bedCI REd SnGQTUDNDP I VPGRI SNHUVSEERRRGLVRHllUCfltlTSPDUERLKflTR 

ndoB SNL GFGEDUYGDRUYPGUUGKSR I GETSYRGF YRRVQRHUSSSHUflEFEHRSST 

bend RGSKHU I YGPDDRRNE I GLKPfl I SG I KTEDEGLYLRQHQYULKSnKQR | RREICEFASRQ 

* * 

bedCI 

ndoB UHTELTKTTDR 
bend GENfl 


Figure 4 Comparison of the amino acid homology of the ISP a subunits from the Class IIB 
benzene (bedCI) (59), Class III, naphthalene (ndoB) (67), and Class IB benzoate ( benA ) (79) 
dioxygenases. The asterisks indicate conserved amino acids. 


followed by reaction of the peroxo complex with the substrate. Twilfer et al 
(110) obtained a series of nitrosyl analogs of the oxo-derivatives by using 
nitric oxide. Since NO is an odd-electron species, it can convert Fe(II), which 
is normally undetectable by EPR, to an EPR-detectable species. EPR signals 
were observed for high-spin ( S = 3/2) Fe(II)*NO (or Fe(III)*NO - ) species, 
with g-factors g x — 4, g y = 2. The g-factors of signals changed upon binding 
of several substrates (110). 

coordination of the iron-sulfur clusters The [2Fe-2S] clusters of 
the Rieske proteins of electron -transfer chains, and of the aromatic di- 
oxygenases, appear to have similar chemical structures, as judged by the 
similarities in amino-acid sequences and spectroscopic properties (Figure 3). 
On this assumption, the results of the Rieske-protein studies, such as that 
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from Thermus thermophilus (32), have been extrapolated to those of the 
dioxygenases. 

The sequences for the Rieske iron-sulfur protein from various organisms 
have been determined by amino acid sequencing [beef heart (94)] or deduced 
from their DNA sequence [spinach cytochrome b 6 -f complex (94), Neuro - 
spora crassa (55), Saccharomyces cerevisiae (6), Rhodobacter capsulatus 
(36), and Paracoccus denitrificans (68)]. Analysis of these amino acid se- 
quences reveals that the cysteine residues are conserved in two regions (94). 
From a comparison of these conserved regions ,with those observed in the 
terminal oxygenase and the ferredoxin components of the dioxygenases, 
shown in Figure 3, the following consensus sequence emerged: 

Cys-X-His-15 to 17 amino acids-Cys-X-X-His 

(where X is any amino acid). Because all the above-mentioned proteins 
contain an iron-sulfur cluster that has Rieske-type properties, we propose that 
this consensus sequence is the motif involved in binding to the Rieske-type 
[2Fe-2S] cluster. 

In the conserved regions of sequences of the Rieske proteins, each histidine 
residue is flanked by a single glycine residue either next to it or one residue 
away. Glycine residues occur in the turns of protein secondary structure (51), 
and the [2Fe-2S] cluster may be situated in a cleft of the protein. One of the 
two conserved regions in these Rieske proteins is found in a hydrophobic part 
of the protein and the other in a more amphipathic environment [hydrophobic- 
ity plots (94)]. 

Mossbauer spectra of the dioxygenase protein of benzene dioxygenase from 
P. putida ML2 (37) indicated the presence of [2Fe-2S] clusters similar to 
those observed in the plant-type ferredoxins. The main difference was the 
isomer shift of one of the iron sites, which was greater than in the ferredoxins. 
This effect is similar to that observed in Rieske-type proteins such as that from 
Thermus thermophilus (32). The Mossbauer parameters were compared with 
those of the monooxygenase ferredoxin putidaredoxin, which has a [2Fe-2S] 
cluster with all-sulfur ligation (78). In the reduced state, the Fe(III) site of the 
dioxygenase Fe-S clusters was similar to the corresponding ion in the ferre- 
doxin, but the Fe(II) sites were different, as shown by the electric-field 
gradient and hyperfine tensors. This difference between the two types of 
protein might be explained by different distortions of the cluster geometry, or 
alternatively by their having different ligands. The latter interpretation consid- 
ered it likely that the Fe(II) iron site had nonsulfur ligands, because in the 
Rieske cluster, this iron atom had a larger Mossbauer isomer shift than that in 
putidaredoxin (78). 

Electron-spin relaxation processes are characteristic of the iron-sulfur clust- 
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ers. In EPR spectra, the relaxation rate affects the temperature range over 
which the spectrum can be determined; in Mossbauer spectra it determines the 
highest temperature at which hyperfine structure is observed. It enables the 
reduced [2Fe-2S] clusters to be distinguished, for example, from the more 
rapidly relaxing [4Fe-4S] clusters. The electron-spin relaxation rate of the 
[2Fe-2S] clusters in the dioxygenases and the Rieske proteins is slower than in 
plant-type ferredoxins, but faster than in hydroxylase ferredoxins such as 
putidaredoxin and adrenodoxin. Bertrand et al (10) interpreted such effects in 
terms of differences in geometry of the [2Fe-2S] cluster. 

Cline et al (23), by using electron nuclear double resonance (ENDOR) and 
electron spin-echo envelope modulation (ESEEM) spectroscopy, observed 
hyperfine interactions in the [2Fe-2S] clusters in the respiratory Rieske iron- 
sulfur protein of Thermus thermophilus and in the phthalate dioxygenase of 
Pseudomonas cepacia. All of the spectra showed resonances attributed to 
protons that were weakly coupled to the iron-sulfur clusters. Similar proton 
interactions have been found in all iron-sulfur proteins. However, in addition, 
unusually strong hyperfine resonances were observed, which were assigned to 
the U N of two histidines. 

In a recent series of measurements of the phthalate dioxygenase of P. 
cepacia, Gurbiel et al (48) established the nature of the nitrogen-containing 
ligands to the [2Fe-2S] Rieske-type cluster. Cells of a histidine auxotroph 
were grown on ,5 N-labeled histidine and 14 N background (using natural- 
abundance NH4 as nitrogen source), and on ,4 N histidine in a 15 N back- 
ground. In spectra recorded at Q-band microwave frequency (35 GHz), the 
ENDOR lines due to nitrogen were clearly distinguished from those due to 
protons. The pairs of nitrogen hyperfine lines were assigned as resulting from 
two nitrogens with similar coupling constants. All of the ENDOR lines 
assigned to nitrogen behaved as expected for l5 N (nuclear spin / = 1/2) in the 
I5 N-histidine protein, and for 14 N (nuclear spin / = 1) in the 14 N-histidine 
protein. These results demonstrate the presence of two histidines in the first 
coordination sphere of iron. Furthermore, specific 15 N labeling showed that 
the 6-nitrogens of the imidazole ring bind the iron (47). Based on these 
results, the following structure is proposed as a model of the Rieske iron- 
sulfur cluster: 




n / ,N 


Oxidized 


Reduced 
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Kuila et al (66) determined the resonance Raman spectra of the Rieske 
protein of T. thermophilus and the phthalate dioxygenase of P. cepacia. 
Resonance Raman spectroscopy examines the vibrational modes of the 
chromophoric metal center. For the plant-type ferredoxin [2Fe-2S] cluster, 
the modes are as expected for a centrosymmetric cluster. For the dioxygenase 
cluster, the spectrum showed additional vibrational modes, indicating a major 
perturbation from centrosymmetry. The spectrum was analyzed in terms of 
the above model, with two sulfur and two nitrogen ligands in various relative 
positions. Of the three possible arrangements, the above, with both nitrogens 
on one iron atom, was considered to be the most likely. 

X-ray absorption spectroscopy has been applied to the Rieske protein of 
heart mitochondria (84) and the terminal dioxygenase component of the 
phthalate dioxygenase of P. cepacia (107). In order to observe the Rieske 
cluster in the dioxygenase protein, without interference from the Fe center, 
the latter was eliminated by depletion of iron and replacement with cobalt or 
zinc. Extended X-ray absorption fine structure (EXAFS) spectra of an atom 
such as an iron site depend critically on distances of nearby atoms, particular- 
ly heavy atoms. They depend less on the number of atoms and the bond 
angles. Based on the above model, Tsang et al (107) estimated the following 
distances for the oxidized cluster: Fe-S (bridging), 0.22 nm; Fe-S (terminal), 
0.231 nm; Fe-Fe, 0.268 nm. The Fe-Fe distance expanded slightly on reduc- 
tion. Unfortunately, it was not possible to estimate the number or distances of 
any nitrogen ligands. 

FACTORS INFLUENCING THE MIDPOINT REDOX POTENTIALS OF THE IRON- 

SULFUR clusters Apparently, two, or possibly three, classes of proteins 
contain Rieske-type [2Fe-2S] clusters. The first are the membrane-bound 
proteins involved in respiration and photosynthesis, which contain a single 
[2Fe-2S] cluster and are characterized by having an unusually high redox 
potential that varies with pH. The protein from T. thermophilus that has 
similar properties reportedly contains two [2Fe-2S] clusters per molecule (32) 
and might therefore represent another class of proteins; the sequence is not yet 
determined. The other group includes the NADH-dependent dioxygenase 
components and the 4-methoxybenzoate monooxygenase. By contrast, the 
negative redox potential of the [2Fe-2S] cluster in the dioxygenase com- 
ponents does not appear to be pH dependent (33, 66). Although the coordina- 
tion environment is spectroscopically similar in all three classes of proteins, 
other differences must exist that modulate the properties of the iron-sulfur 
cluster in meeting the different functional needs. 

The redox potentials of the [2Fe-2Sl clusters in the aromatic dioxygenases 
are less negative, by 100-250 mV, than the ferredoxins of photosynthesis or 
P-450 systems, but not as positive as the Rieske clusters of respiratory chains. 
These differences should have a functional significance. In photosynthesis, a 
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strong reductant is required to reduce NADP, while in complexes of the bc x 
type, the Rieske clusters must be reducible by quinones. The dioxygenase 
systems apparently need not have as negative a potential ■•as that in the 
P-450 C am system. This may be an advantage, as a higher^potential ferredoxin 
would be less likely to react with 0 2 to generate superoxide. 

Most iron-sulfur clusters are strong reducing agents. Their midpoint poten- 
tials are generally more negative than those of simple iron centers. This may 
be the consequence of the net negative charge on the clusters, which results 
from the charges on the cysteine ligands and bridging sulfides. However, in 
addition to the Rieske-type iron-sulfur proteins, other types of iron-sulfur 
clusters have less negative midpoint potentials, including the high-potential 
iron-sulfur proteins (HiPIPs), in which the cluster can take Up a higher 
oxidation state with fewer electrons, and the [3Fe-4S] clusters. In addition, 
the midpoint potentials of iron-sulfur clusters of a particular type differ 
considerably depending on their protein environment (19). 

The most important factors determining the midpoint potential are probably 
die electrostatic charge on the cluster and the polarity of the environment (see 
76). Other more subtle factors include hydrogen bonds from the sulfides to 
polypeptide amides (1, 21) and the bond lengths and bond angles of the 
iron-sulfur cluster (81). The ligands to the Fe atoms are expected to play a 
major role (107). In the plant- type fenedoxins, the ligands to the [2Fe-2S] 
cluster are all cysteine sulfurs. The replacement of thiolate by carboxylate 
ligands for [4Fe-4S] clusters resulted in substantial positive shifts of the redox 
potential [100 mV per substituted ligand (60)]. Probably, substitutions of 
thiolate by nitrogen-containing residues will have a similar effect on the redox 
potential. The formal charge on a nitrogen ligand is 0, compared with —1 for 
a Cys - ligand. The above structure would have a formal charge of 0 in the 
oxidized form compared to —2 for the classical [2Fe-2S] 2+ (CysS”) 4 struc- 
ture. A single electron would therefore be more easily stabilized on the Rieske 
cluster (65). The electrostatic effect of charged amino acids around the 
clusters could also influence the midpoint potential. These might include 
histidines that are close to the cluster but not ligands to it. Notably, the 
lower-potential photosynthetic ferredoxins have no histidine residues in the 
vicinity of the cysteine residues. In this context, the surrounding amino acid 
residues of the conserved regions could be targets for substitution studies. 
Using site-directed mutagenesis, one can replace these residues systematically 
and examine the effects on the redox potential and its function in transferring 
electrons. 

CONCLUDING REMARKS 

The aromatic ring-hydroxylating dioxygenases and related oxygenases are 
built up of protein domains and redox centers. Several of these resemble other 
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electron-transfer systems such as those of respiration and photosynthesis. 
However, the aromatic dioxygenases show considerable diversity (Table 1) 
and appear to have multiple origins. The iron-sulfur flavoprotein might have 
evolved by gene fusion between the separate ferredoxin and flavoprotein (18), 
as found in the benzene dioxygenase. However, the iron-sulfur component of 
the flavoproteins is related to the plant-type ferredoxins (52), while the 
ferredoxin components are more closely related to the Rieske proteins (Figure 
3). The diversity of the electron-transfer chains in the dioxygenases so far 
discovered indicates that further permutations of electron carriers will almost 
certainly be found in others. 
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Table 1 MulU<ompc«ent hydroxylating dioxygenases. 





Reductase 

Ferredoxin 

Oxygenase 


Enzyme system 


Prosthetic 

Prosthetic 

Structure (Af r ) 

Prosthetic 

Class 

(reference) 

Mi 

groups) 

M t grOup{») 

gtoup(s) 

1A 

Phthalate dioxygenase (1,2) 

34 000 

FMN[2fe-2S] 

None 

0(48 000)4 

4(2Fe*2SlR,4Fe 

1A 

4-ChkHophenylacetate 

35 000 

FMN [2Fe-2SJ 

None 

o(46000b 

3(2F©-2SJ 

IA 

3.4- <fioxygeaase (3,4) 
4-Sttlpbobenz0ate 

3.4- dioxygenase (5) 

36 000 

FMN [2Fc-2S] 

None 

0(50000)2 
o(5l 000V 

2[2Fc-2S]r. 2Fc 
nd 

1A 

Chlorobenzoaic 

42000 

nd 

None 

IB 

3,4-dioxygenase (6) 
2-Halobeozoate 

37 500 

FAD [2Fe-2SI 

None 

c<52 000b 

3(2Fe-2S], 3Fe 


1 .2-diox vecnase (7) 




PttOOOOb 

3{2Fe-2S], 3Fe 

IB 

Benzoate dioxygenase (8-1 1) 37 500 

FAD (2Fe-2SJ 

None 

o(50 000b 

paoooob 

0(55000)6 

IB 

2-oxo- l«2rdihydroquinoline 

38000 

FAD [2Fc-2S] 

None 

6[2Fc-2S]r. ?Fe 


8-monooxygenase (12) 


- 








Table \ 

continued 







Reductase 

Ferredoxin 

Oxygenase 

Class 

Enzyme system 
(reference) 

Mr 

Prosthetic 

group(s) 

Mr 

Prosthetic 

group(s) 

Structure (M r ) 

Prosthetic 

group(s) 

HA 

Pyrazon (fioxygenase (13) 

67 000 

FAD 

12000 

(2F6-2S) 

180 000 

[2Fe-2S] 

HA 

Dibenzofttran 
4,4a-di oxygenase (14) 

44 000 

FAD 

12000 

[2F6-2S] 

O(45000)z 

PC23000h 

2f2Fe-2SJ. 2Fe 

IIB 

Benzene 1,2-dioxygenase 
05-21) 

87 168“ 
(b2) 

FAD 

11939“ 

[2Fd-2S]r 

0(51 105^2 
PC22 252^2 

2[2Fo-2S]r, 2Fe 

IIB 

Toluene 23-dioxygenase 
(22-26) 

42942* 

FAD 

11900“ 

[2Fo-2S]r 

a(50930“h 

PO2013“)2 

2£2Fb-2S]R.2Fe 

IIB 

Biphenyl dioxygenase 
(27-31) 

42 970“ 

FAD 

11981“ 

[2F6-2S1r 

a<51513“h 

Wriovf-n 

2(2Fo-2S]r.2Fc 

in 

Naphthalene dioxygenase 
(32-36) 

35 552“ 

FAD [2Fe-2S] 

11767“ 

[2Fe-2S) R 

0(49597*^2 

P(23038®)2 

2(2Fo-2S1r, 2Fe 


R indicates a Rieske-type [2Fe-2Sl cl aster, nd/not determined; Aff observed from SDS-PAGE or Q deduced from amino-acid sequence. 
References: 1. Bade er <xf^ 1987; 2, Bade and Ballou, 1990; 3, Schweizerei ol. 1987; 4, Markus er of, 1986; 5, Locber el at, 1991; 6. Nakaisu 
and Wyndham, 1993;7, Fetmex eiai., 1992; 8, Ytm&guchi and Fujisawa, 1978; 9. Vamaguchi and Fujisawa, 1980; 10, Yamaguchi and Fujisawa. 
1982; H'Moodicerot, 1990; 12, Roschc et aL, 1993; 13, Saubere/ot, 1977; 14, Bunz and Cook, 1993; IS, AxeeD and Geary, 197S; 16. Geary 
el aL, 1990; 17, Zamasian and Mason, 1987; 18. Geary et aL, 1984; 19, Tan and Mason, 1990; 20. Tan et aL, 1993: 21. Crutcher and Geary. 
1979; 22, Zylstra and Gibson. 1989:23, Wackctt. 1990; 24. Zylstra et ai f 1988; 25, Subramanian etaL, 1981;26«Siibramanianef of^ 1985; 27, 
Furukawa ciaL 1993; 28, Mcndello. 1989; 29, Erickson and Mondello. 1992; 30, Qhson et aL, 1993; 3 1, Taira et a ! „ 1992; 32, Haigler and 
Gibson. 1990a: 33. Haigler and Gbson. 1990b; 34. Ensleyexn/., 19B2; 35, Ensley and Haigler. 1990; 36. Sueo and Gibson. 1993. 
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Nucleotide Sequences of the Acinetobacter calcoaceticus benABC 
Genes for Benzoate 1,2-Dioxygenase Reveal Evolutionary 
Relationships among Multicomponent Oxygenases 
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MONIQUE REKIK, 2 and SHIGEAKI HARAYAMA 2 
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Biochemistry, Faculty of Medicine, University of Geneva, 121 7 Geneva 4, Switzerland 1 
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The nucleotide sequences of the Acinetobacter calcoaceticus benABC genes encoding a multicomponent 
oxygenase for the conversion of benzoate to a nonaromatic cis-diol were determined. The enzyme, benzoate 

1,2-dioxygenase, is composed of a hydroxylase component, encoded by benAB, and an electron transfer 
component, encoded by benC, Comparison of the deduced amino acid sequences of BenABC with related 
sequences, including those for the multicomponent toluate, toluene, benzene, and naphthalene 1,2-dioxygen- 
ases, indicated that the similarly sized subunits of the hydroxylase components were derived from a common 
ancestor. Conserved cysteine and histidine residues may bind a [2Fe-2S] Rieske-type cluster to the a-subunits 
of all the hydroxylases. Conserved histidines and tyrosines may coordinate a mononuclear Fe(II) ion. The less 
conserved 0-subunits of the hydroxylases may be responsible for determining substrate specificity. Each 
dioxygenase had either one or two electron transfer proteins. The electron transfer component of benzoate 
dioxygenase, encoded by benC, and the corresponding protein of the toluate 1,2-dioxygenase, encoded by xytZ, 
were each found to have an N- terminal region which resembled chloroplast-type ferredoxins and a C-terminal 
region which resembled several oxidoreductases. These BenC and XylZ proteins had regions similar to certain 
monooxygenase components but did not appear to be evolutionarily related to the two-protein electron transfer 
systems of the benzene, toluene, and naphthalene 1,2-dioxygenases. Regions of possible NAD and flavin adenine 
dinucleotide binding were identified. 


The complete degradation of benzoate by aerobic bacteria 
can occur by either of two catabolic pathways. In both 
reaction sequences, benzoate is converted to a nonaromatic 
cis-dio\, 2-hydro-l ,2-dihydroxy benzoate, and then to cate- 
chol (51) (Fig. 1). Catechol is cleaved between the two 
hydroxyl groups when benzoate is metabolized via the ortho 
(or 0-ketoadipate) pathway, and subsequent reactions yield 
the tricarboxylic acid cycle intermediates succinate and 
acetyl coenzyme A (59). Catechol cleavage via the meta 
pathway occurs adjacent to the hydroxyl groups, and sub- 
sequent reactions lead to the formation of pyruvate and 
acetaldehyde. The ortho- and me/a-fission routes do not 
usually operate concurrently, and regulatory factors deter- 
mine how benzoate is degraded when genes encoding both 
pathways are present in the host cell (39). 

The genes responsible for the conversion of benzoate to its 
corresponding c/5-diol have been isolated from two different 
soil bacteria: benABC from the chromosome of Acinetobac- 
ter calcoaceticus (42) and xylXYZ from TOL plasmid pWWO 
of Pseudomonas putida (78). The £enABC-encoded ben- 
zoate 1,2-dioxygenase exhibits narrow substrate specificity, 
showing little or no oxidation of most substituted benzoates. 
In contrast, the plasmid-encoded meta pathway toluate 

1,2-dioxygenase, encoded by xylXYZ, catalyzes the hydrox- 
ylation not only of benzoate but of many 3- and 4-substituted 
benzoates including toluates (78). Benzoate 1,2-dioxygenase 
and toluate 1,2-dioxygenase are each multicomponent en- 
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zymes consisting of three different polypeptides of about 53, 
19, and 38 kDa (19, 21, 42). 

The similar sizes of the polypeptides and the similarity 
among the biochemical reactions catalyzed by the benABC - 
and xylXYZ-e ncoded enzymes raise questions about their 
evolutionary origins. Benzoate and toluate 1,2-dioxygenases 
are structurally related to a family of multicomponent aro- 
matic ring-hydroxylating dioxygenases (9). These dioxygen- 
ases contain an aromatic ring hydroxylase component com- 
posed of two different subunits. The sizes of these a and 0 
subunits, about 50 and 20 kDa, respectively, are similar in 
different dioxygenases. Studies of benzoate 1,2-dioxygenase 
from the soil bacterium Pseudomonas arvilla C-l indicate 
that the structure of this native enzyme is a 3 0 3 (75). Each 
a-subunit contains one iron-sulfur cluster [2Fe-2S], whereas 
the isolated 0-subunit does not contain any detectable iron 
or labile sulfide. One mononuclear Fe(II) atom may also be 
associated with each hydroxylase component (75). 

In addition to the hydroxylase component, the dioxygen- 
ases described above usually contain one or two electron 
transport proteins. The benzoate 1,2-dioxygenase of F. 
arvilla has a single iron-sulfur flavoprotein exhibiting an 
NADH-cytochrome c reductase activity that is responsible 
for the electron transfer from NADH to the aromatic ring 
hydroxylase. This enzyme is a 38- kDa polypeptide with one 
iron-sulfur cluster of the [2Fe-2S] type and one molecule of 
flavin adenine dinucleotide (FAD) (73, 74). The involvement 
of similar proteins in the electron transfer reactions of 
benzoate 1,2-dioxygenase from A . calcoaceticus and toluate 

1,2-dioxygenase from F. putida have been suggested by our 
previous genetic studies (19, 21, 42). In the benzene, tolu- 
ene, and naphthalene dioxygenase systems, however, two 
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FIG. 1. Multicomponent dioxygenases initiate benzoate (1) deg- 
radation by converting it to 2-hydro-l ,2-dihydroxybenzoate (DHB, 
2). Benzoate and toluate 1,2-dioxygenases are each composed of a 
two-subunit hydroxylase, encoded by benAB and xyLXY, respec- 
tively, and an electron transfer oxidoreductase encoded by benC 
and xylZ, respectively. DHB is converted to catechol (3) by an 
NADndependent dehydrogenase, the gene product of benD or xylL. 
The chromosomal ben genes, isolated from A. calcoaceticus , en- 
code enzymes specific for the substrate R=R'=H. The plasmid- 
borne xyl genes, isolated from P. putida TOL plasmid pWWO, 
encode enzymes with broader substrate specificity and can degrade 
compounds with methyl substitutions at positions R and R\ 


subclones were generated from smaller Kpnl-Kpnl, Kpn\- 
EcoRI, and Kpnl-SaB restriction endonuclease fragments of 
the ben DNA. Escherichia coli JM101 supE thi A(lac-proAB) 
(F' traD36 proAB laqF* 2AM 15) (69) was used as a host for 
recombinant plasmids and bacteriophages. 

The Cyclone I Biosystem of International Biotechnolo- 
gies, Inc. (New Haven, Conn.) was used to generate nested 
deletions of the insert DNA of recombinant bacteriophages. 
Using this method, the exposed single-stranded ends of 
cloned DNA fragments were progressively digested by T4 
DNA polymerase. Overlapping clones entirely covering both 
strands of the benABC genes were isolated. 

DNA sequencing. Procedures described in the “M13 Clon- 
ing/Dideoxy Sequencing Instruction Manual" (4a) were fol- 
lowed for the isolation and propagation of M13 phages and 
for the generation of single-stranded DNA sequencing tem- 
plates. DNA sequence was determined by the dideoxy chain 
termination method (56) with commercial kits obtained from 
Bethesda Research Laboratories, Inc. (Gaithersburg, Md.) 
and United States Biochemical Corp. (Cleveland, Ohio). 
a- 35 S-dATP was purchased from Amersham Corp. (Arling- 
ton Heights, 111.). As described previously (41), sequencing 
reaction mixtures were electrophoretically separated on 8% 
polyacrylamide gels with 42% urea in Tris-borate-EDTA 
buffer. Gels were dried under vacuum with a gel slab drier 
(model 224; Bio-Rad Laboratories, Richmond, Calif.) before 
autoradiography with Kodak X-Omat film. 

DNA sequence analysis. Computer-assisted sequence anal- 
ysis was done with software packages PC/GENE (Genofit; 
Intelligenetics; developed by A. Bairoch) and MicroGenie 
(Beckman). Protein similarities were detected by scanning 
version 13 of the SWISS-PROT protein sequence data bank 
(A. Bairoch; distributed by the EMBL Data Library) with an 
algorithm based on that of Lipman and Pearson (37). Com- 
puter-calculated alignments were used to assess the signifi- 
cance of sequence similarities (40), and secondary structure 
predictions were computed by using the conformational 
parameters of Chou and Fasman (7). 

Nucleotide sequence accession number. The sequence of 
the A. calcoaceticus 3,353-bp //mdlll-EcoRI restriction 
endonuclease fragment described in this report has been 
deposited with GenBank under accession number M62649. 


different proteins are required for the electron transfer 
reaction (2, 16, 17, 60, 61). 

To better understand the evolutionary relationships 
among aromatic ring-hydroxy lating dioxygenases, we se- 
quenced benABC and xylXYZ . The sequencing of the latter 
genes is presented elsewhere (20). In this report, we present 
the nucleotide sequence of the A. calcoaceticus benABC 
genes. The deduced amino acid sequences of the gene 
products responsible for the hydroxylation of benzoate were 
compared with those of enzymes which catalyze similar 
reactions. 

MATERIALS AND METHODS 

Bacterial strains, plasmids, and bacteriophages. Recombi- 
nant plasmids derived from pUC19 (69, 76) carrying wild- 
type DNA of A. calcoaceticus BD413 (29) (designated ADP1 
in our collection) from the chromosomal ben region have 
been previously described (42). The 3.4-kbp insert of 
pIB1354 carrying benABC was subcloned into the replicative 
form of bacteriophages M13mpl8 and M13mpl9 (76). Similar 


RESULTS 

Nucleotide sequence of benABC genes. The benABC genes 
were previously localized to a 3.4 kbp Hindlll-EcoKl restric- 
tion endonuclease fragment cloned from the A. calcoace- 
ticus ADP1 chromosome (42). The nucleotide sequence of 
this 3,353rbp DNA fragment was determined and, as shown 
in Fig. 2, it contained three open reading frames. Computer 
analysis by three different methods (12, 32, 57) suggested 
that each open reading frame would be likely to encode a 
protein. The first open reading frame, corresponding to 
benA , started at nucleotide 347 (Fig. 2). An ATG transla- 
tional start codon was preceded by the putative ribosome 
binding sequence GGAG (58). Translation would yield a 
protein product of 52,079 Da, the size of the predicted 
benA-c ncoded hydroxylase subunit (42). 

The second open reading frame, corresponding to benB , 
started at nucleotide 1729. An initiation codon at this posi- 
tion was preceded by the ribosome binding sequence 
GGAG. The deduced molecular mass of the benB product, 
20,073 Da, was in agreement with previous in vitro analysis 
(42). The ATG initiation codon of benB overlapped with the 
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MGCTTTGCT CCTCMCOC MCCAOMA tAGCGGMGAT CTCIMCTTC CU1TMMA TACTCCAXAE G X AX TTT A TT AHCAM1AA TCTCTT7GAA CTTWTAMA CATTCTTTTA AGCTUAAAC 130 
AACCMCMA GACAACAAd AOGCAG CC GC TTOSCXXATT AA MU.il 1L TTCAATTTCC AAAATTGAAA GCTGAAAIGC ATATTCCTTT TATTTCTOCC TTCTGCCCTT AACTAAACAT TTTATGCCTT 260 
OCCTTCTTTA ATTGAATCTT TGAC1AAGCA CAGGCTTTTC CTCTOGCCTA GMCAACTTTC TTATTTTGGA ATGT TGGAGA AAGCAT 

ATC CCA CCT ATT CCC GTC ATT AAT ACT ACC CAT CTT | 

Hat Pro Arg 11« Pro Val II* Am Thr S*r Hi* Lau I 

GAC CCA ATT GAT GAA CTC CTT GTA GAC AAT ACC CAA ACA GGT GAA TTT AAC TTA CAT CCT TCT CIA TTT ACA GAT CAC GCA CTT ITT GAT CTT GAA ATC AAA TAC 417 > 1 

Asp Arg He Asp Glu Leu Leo Val Asp Asa Thr Gla Thr Cly Gla Pb* Lys Leu Bis Arg Sex Vsl Ph* Tbr Asp Gin Ale Leu Ttm Asp Leu Glu Hot Lys Tyr I 

ATT TTC GAA GCA AAT TOC CTT TAT TTC CCT CAT CAA ACC CAC ATT CCC AAC AAC AAC GAC TAT TAC ACC ACC TAT ATT 00C AA CAA CCC ATT TTC ATT GCC CCT 592 I 

lie Pha Glu Gly Asa Txp Val Tyr Leu Ala His Clu S*r Gla II* Pro Asa Asa Asa Asp Tyr Tyr Thr Tbr Tyr II* Cly Arg Gin pro Ha Lw II* Ala Arg » 

AAT CCT AAC GGT GAA CTC AAC GCC ATC ATT AAC GCA TCT TCA CAT CCT GGT GCA CAC CTC CTC CCT CAT AAG CCT OCT AAT AAC ACC ACA TAT ACT TOC CCA TTT 697 | 

Asa Pro Asa Cly Gla L*u Asa Ms Hat Ila Asa Ala Cys S*r Bis Arg Cly Ala Gla Lao Lra Cly Els Lys Arg Gly Am Lys Tbr Tbr Tyr Tbr Cys Pro Fbe j 

CAT GGC TOC ACC TTC AAT AAC TCA GGA AAA TTG TTC AAG CTC AAA CAT CCA AGC GAT CCT CCT TAT TCA GAT TCT TTT AAT CAC GAC OCT TCC CSC GAC TTA AAA 102 | 

Bis Gly Trp Tbr Ph* Asa Asa Sar Gly Lys Leo Leu Lys Vsl Lys Asp Pro Ser Asp Ala Cly Tyr Sar Asp Cys Pha Asn Gin Asp Cly Ser Bis Asp Leo Lys j 

AACGTCGCCCGTTTTGAAAGTTATAAAGCTTTTTTATTTGGC ACT CTCAATCCTCTAGATCCCTCACTCCAAGACTTTTTGGGGGAAACCACCAAAATTATCGK: 907 | 

Lys Val Ala Arg Pha Glu Ser Tyr Lys Gly Pha Lau Ph* Gly Sar Len Asa Pro Val Asp Fro Sar Lau Gla Clu Pha Leu Cly Glu Thr Thr Lys II* Ila Asp I 

ATC An CTC GGC CAA TCC GAT CAC GGC CTT GAA CCA CTC COT GGT CTT TCC ACC TAC ACC TAT CAA GGA AAC TGG AAC TTC ACC CCA GAA AAC GGA GCA GAT GQC 1012 I 

Hat Ila Val Gly Gla Sar Asp Gla Gly Lau Clu Val Lau Arg Gly Val Sar Tbr Tyr Tbr Tyr Clu Gly Asa Trp lys Lau Tbr Ala Glu Am Gly Ala Asp Gly I 

TAT CAT CTT TCC GCC CTC CAC TGG AAC TAT GCA GCC ACC ACC CAG CAT CCT AAA GAA AAA CAG GCA GGT GAT ACC ATT CGC GCC ATG AGC GCC GGC TCC TCC GGC 1117 ben A 
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Lys 

Gin 
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dy 
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Ha 
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1222 1 
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1327 | 

Tbr Glu Lys Pha 

Gly Ala Ala Met 

Sar Lys Trp Hat Ila Glu 
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Tyr 

Fro 
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Tyr 
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1 

| 
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1432 | 
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CAA 

TOG 

am: 

cm: 

ATC 

1537 | 
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1642 | 
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GAC ACC ATC AGC ATC GAA CAC ATT AGC CAC TTT TTC TAT AGC GAG GGC OCT TTT RA GAT GAT GAC CAA TGG GAT GAC TGG CTT GAA TCT TAT GCA CCT CAA GCC TCA 1SS7 

Asp Tbr He Saz 11a Glu Gla 11a Sar da Pha Lau Tyr Sar Glu Ala Arg Pha Lau Asp Asp Glu Gin Trp Asp Asp Trp Leu Glu Cys Tyr Ala Pro Gin Als Ser 

TTT TGG ATG CCC GCC TGG GAC GAT AAC GAC CW CTC ACT GAA AAC CCC CAA ACT GAG ATT TOG CTC ATT TAT TAT CCA AT GGC CAA CCT CTT GAA GAT CCA GTA TTT 196S 

Fbe Trp Mat Pro Ala Trp Asp Asp Am Asp Gin Leu Tbr Glu Am Pro Gin Thr Gla Ila Ser Leu lie tyr Tyr Pro Asp Arg Gin Gly Leu Clu Asp Arg Val Pha 

CCC ATT AAA ACC GAC CCT TCA TCC GCC ACC ATG GCA GAT ACC CCT ACC GCA CAC AAT ATT AGC AAT ATC GAA CTT GAA TCA CCT GAT GCC CTT CAA ATC ACA CIA CCT 2073 
Arg lie Lys Tbr Glu Arg Ser Ser Als Tbr Met Pro Asp Tbr Arg Tbr Ala Bis Am Ila Ser Asn He Glu Vsl du Sar Arg Asp Gly Lau Gin II* Tbr Val Arg 

TTT AAC TGG AAT ACC CTC ACT TTC CCC TAT AAA AAC ACT TAC AGC TAT TTT GGC AXC TCA CGC TAT GTA ATC CAT TTC TCA GGT GAA CAA CCA AAA ATC TTG AGC AAG 21*1 

Ph* Am Trp Am Thr Lau Sar Pha Arg Tyr Lys Am Sar Tyr Ser Tyr Pb* Cly Mt Sar Arg Try val Ila Asp Pb* Ser dy du Gin Pro Lys II* Lau Sar Lys 

m GIG ATG CTT AAG AAT GAC TAT AIT AAT CAA CTC ATT GAT ATT TXT CAT ATT TAA ATCAGTTAXA CCTOCT CA A ACQGACACTT GACIS&QSG TTT ATC ACT TTA TAT TTG 2295 
Tyr Val Met Leu Lys Asn Asp Tyr lie Am On val Ila Asp Ha Tyr Bis He End Met Sar Lau Tyr Lau 

AAT AGC ATT CCT OCC ATC TCA AAC CAT CAA CIA GCA CTT CAA TTT (AA GAT GGC CTT ACC CCT TTT ATC TCC ATC CCT OA GGT GAA ACC TTA TCC GAT GCA GCA TAC 2403 

Am Arg lie Pro Als Met Sar Am Bis Gin Val Ala Leu da Pha du Asp dy Vsl Tbr Arg Phs Ila Cys Ila Ala On dy Glu Tbr Lau Sar Asp Ala Ala Tyr 

CGT CAC CAA ATC AAT AIT OCA ATG GAC TGC CCT GAA GGC GAC TGT GGT ACC TOC CCT GCT CTT TCT GAA TOG GGC AAC BIT (AC ATG CCT GAA GAC AAT TAC ATT GAA 2511 

Arg Gin Gin Ha Am Ila Pro Met Asp cys Arg Glu dy du Cys dy Tbr Cys Arg Ala Pb* Cys Glu Ser Gly Asn Tyr Asp Met Pro Glu Asp Am Tyr Ila Glu 

CAT COl CTC ACC OQ GAA GAA QCG CAG CAG GGC 1MC CTT TTC GCA TCT OlA TCC CCT GCA ACT TCA GAT OCT GTA TTT CAA MT CAG CCC TCT TCT GAC CTA TCT AAA 2619 

Asp Ala Lau Thr Pro Glu du Ala Gin Gin dy Tyr Vsl Leu Ala Cys dn cys Arg Pro Thr Sar Asp Ala Val Pb* Gin II* Gin Ala Sar Sar Glu Val Cys Lys 

AGC AAG ATT CAT CAC TTT GAA GGC ACC TTG GG6 CGC GTT (M AAT CTA TCC GAT TCG ACC ATC ACC ITT GKT ATT CAC CTC CAT GAC GOT CAC CCC GAT ATT CAT TTT 2727 

Thr Lys Ila Bis His Pha Clu Gly Thr Leu Ala Arg Vsl dn Am Leu Sar Asp Ser Tbr Ue Thr Fbe Asp H* Gin Leu Asp Asp Cly Gin Pro Asp II* His Ph# 

CTC GCA GGG CAC TAT CTC AAC CTC ACC CTC CGC GDC AGC ACC GAA ACA CGC TCC TAT TCC TTT AGC TCA CAA COt GGC AAT CGC TTA ACC GGC TTC CTG CTT CGT AAT 2S3S 
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FIG. 2. DNA sequence of the 3,353-bp chromosomal A. calcoaceticus //mdlll-EcoRI restriction endonuclease fragment carrying the 
benABC genes. The open reading frame corresponding to each gene is marked by broken lines, and within each open reading frame, 
nucleotides are arranged as codons above the deduced amino acid sequence. Numbers indicate the nucleotide positions of adjacent bases. 
Possible ribosome binding sites are underlined at positions 336, 1718, and 2273. A second possible initiation codon and ribosome binding site 
for the translation of benC are found at positions 2299 and 2311, respectively. 


TGA stop codon of benA. This arrangement, which may be 
involved in translational coupling, has been observed previ- 
ously with catabolic genes, in particular those, like benA and 
benB , encoding nonidentical enzymatic subunits (44). 

Two possible translational initiation sites were found for 


the third open reading frame, benC . At positions 2281 and 
2311, potential initiation codons were preceded by the 
ribosome binding sequences GAGG and AGGA, respec- 
tively. Sequence comparisons described below suggested 
that benC starts at position 2281. Translational initiation at 
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MPRIPVINTSHI^RIDEU,VDNTETGEFKLHRSVFTDQALFDLEM1CYIFEGNWVYLA H 
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* ** * * * * 
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** ** 
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★ 


BenA 

XylX 

BnzA(Pl) 

TodCl 

NdoB 


WTFNNSGKLLKVKDPSDAGYSDCFNQDGSKDLKKVARFESYKGFLFGSLNPVDPSLQEFL 
WTFSNSGKLUWKDPKGAGYPDSFDCDGSHDLKKVARFASYRGF LEGS LREDVAPLEEFL 
WAYDTAGNLVNVP YEAES-FA-CLNKKEWSP IX— ARVETYKGLIFANWDENAVDLDTYL 
WAYDTAGNLVNVP YEAES-FA-CLNKKEWSP IX — AKVETYKGLIFANWDENAVDLDTYL 
WGFGSNGELQSVPFEKDL-YGESLNKKCLG-IXEVAKVESFHGFIYGCFDQEAPPLMDYL 
* * * * ***** ** 
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Y 

H 

VG WT 
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* 

* 
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WNYAATTQ 

WNYAATQQ 

LSGILAGL 

LSGILAGL 

ASSLRSGE 


BenA 

XylX 

BnzA(Pl) 

TodCl 

NdoB 


HRKEKQAGDT IRAMSAGSWGKHGGGS 

QRKLRDAGDD IRAMTASSWGGOGGGF 

PEDLEMADLA — PP-TVGKQ YRASWGGHGSGF 

PEDLEMADLA — PP-TVGKQ YRASWGGHGSGF 

SIFSSIAGNAALPPEGAGLQHTSKYGSGMGVLWDG- 


Y 

Y 

Y 

Y 

|y! 


GFEHGHMLLWTQWGNPEDRPNFP 
SFENGHQMVWARWGDPKNRPLFA 
VGDPN1MLAI HGPKVTSYWTEGP 
VGDPNLMLAI MGPKVTSYWTEGP 
SGVHSAOLVPELHAFGGAKQERL 


* * I* 


BenA 

XylX 

BnzA(Pl) 

TodCl 

NdoB 


KAAEYTEKFGAAMSKWMIERSRNIJCLYPNVYLHDQFGSQI-RVLRPISVNKTEVTIYCIA 

ERDRLASEFGEARADWMIGVSRNLCLYPNLYUffiQFGSQL-RITRPLSVDRTEITIYCIA 

ASEXAAERLGSVERGSKLMVEHMTVFPTCSFLP GINTVRTLASARAERGEVWAFTW 

ASEKAAERLGSVERGSKLMVEHMTVFPTCSFLP GINTVRTWHPRGPNEVEVWAFTW 

NKEIGDVRARIYRSHLNCTVFPNNSMLTCS GVFKVWNP I DANTTEVWTYAIVEK 


BenA 

XylX 

BnxA(Pl) 

TodCl 

NdoB 


PVGEAPEARARRIRQYEDFFNASGMATPDDLEELPRCOAGYAGIELEWNDMCRGSKHWIY 
PKGETPR-RARRVRQYEDFFNVSGMATPDDLEEFRACQEGFAGGGM — NDMSRGAKHWIE 

DADAP DD IKEEFRAR-LRTFSPVACSSRTTGRT-GSRSSTSCEATSRSRPFNAEM 

DADAPDD IKEEFRRQT1RTFSAGGVFEQDDGENWVEIQHI LRGHKARSRPFNAEM 

DM— PEDLKRRLADSVQRTFGPAGFWESDDNDNMETASQNGKKYQSRDSDLLSNL 


BenA 
XylX 
BnzA (PI) 
TodCl 
NdoB 


GPDDAANBI GLKP AISGI KTEDEGLYLAQHQYW1KSMKQAI AAEKEF ASRQGENA 

GPDEGAKEIDLHPKLSGVRSEDEGLFVMQHKYWQQCMIKAVKREQDRLIHAEGV 

SMDQTVDNDP VYPGRI SNNVYSEEAARGLYAHWLRMMTSPDWDALKATR 

SMDQTVDN DP VYPGRI SNNVYSEEAARGLYAHWLRMMTSPDWDALKATR 

GFGEDVY GDAVYPGWGKSAIGETSYRGFYRAYQAHVSSSNHAEFEHASSTWHTELTKTTDR 
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FIG. 3. The a-subunits of the hydroxylase components of five multicomponent aromatic ring dioxygenases appear to be homologous. 
Ben A, XylX, BnzA (PI) (27), TodCl (79), and NdoB (34) are approximately 50-kDa components of the benzoate, toluate, benzene, toluene, 
and naphthalene 1,2-dioxygenases, respectively. Positions at which amino acid residues of all proteins are identical are indicated by asterisks. 
Conserved histidines and tyrosines which may coordinate mononuclear Fe(II) atoms are enclosed by boxes. Two conserved cysteines are also 
enclosed by boxes, and these residues together with the closest conserved histidines may bind a Rieske-type [2Fe-2S] center. Gaps introduced 
into the amino acid sequences for the purpose of alignment are indicated by dashes. 


this position would yield a 38,787-Da protein, in good 
agreement with the predicted size of 38 kDa (42). 

Codon usage and G+C content of benABC . The G+C 
content of the benA y -B y and -C genes, 45, 41, and 47%, 
respectively, reflected the characteristic 38 to 47% value 
found in Acinetobacter species (28). Similar codon usage 
was observed among the three genes and was much like that 
found in previously sequenced A. calcoaceticus structural 
genes (6, 8, 13, 14, 41, 45, 54). The most marked preference 
for particular codons was observed for arginine, for which 
two of six possible codons predominated. The CGT and 
CGC codons were used 50 to 63% and 13 to 30% of the time, 
respectively. The bias toward using preferred codons was 
found in E. coli to be a measure of gene expression levels (1). 

Although relatively few genes have been sequenced from 
Acinetobacter species, a comparison of codon usage in 
bacteria of this genus with that found in E. coli (1) suggested 
that the general patterns of codon usage are similar. Differ- 
ences included a bias toward using the GCA codon for 
alanine and the CAA codon for glutamine by Acinetobacter 
species. The E . coli preference for the CTG (leucine) codon 
was not observed in benABC . A comparison of codons used 
for proline showed that the E. coli bias toward CCG was 
replaced by one in Acinetobacter species for CCA. The TCC 
codon for serine was used in fewer than 10% of the serine 
residues by Acinetobacter species but not by E. coli . 

Choice of related sequences for comparison with the de- 
duced amino acid sequences of BenABC. The deduced amino 
acid sequences of the protein components of the A. calco- 
aceticus benzoate 1,2-dioxygenase system were compared 
with those of other aromatic ring-hydroxylating dioxygena- 
ses for which sequence data are available. These include the 


benzene, naphthalene, toluene, and toluate dioxygenases. 
Irie et al. (27) determined the nucleotide sequence of a 
chromosomal DNA fragment of P. putida carrying four 
genes, bnzABCD , encoding the benzene 1,2-dioxygenase 
system. The bnzABCD gene products have been designated 
PI to P4,' and we use these designations parenthetically. 
Sequence data for three components of naphthalene 1,2- 
dioxygenase, encoded by ndoABC , were derived from DNA 
of P. putida NCIB 9816 carrying a naphthalene-degrading 
plasmid (34). The nucleotide sequences of the todClC2BA 
genes for toluene 1,2-dioxygenase have been determined 
from chromosomal DNA of P . putida FI (79). In addition, 
we have determined the nucleotide sequence of the TOL 
plasmid pWWO xylXYZ genes encoding toluate 1,2-dioxy- 
genase (20). 

In the hydroxylase components, the greatest degree of 
similarity was found in the N-terminal region of the a-sub- 
units. The amino acid sequences of BenA, XylX, BnzA (PI) 
(27), TodCl (79), and NdoB (34) were all found to be similar 
(Fig. 3). Pairwise comparisons showed that 62% of the 
aligned BenA and XylX residues were identical, whereas 
comparison of the BenA sequence with each of the others in 
Fig. 3 revealed 20 to 24% identity. These values reflected the 
averaging of approximately 30% identity in the N-terminal 
regions of the protein sequences and 10% identity in the 
C-terminal regions. Each of these proteins is an approxi- 
mately 50-kDa subunit of a dioxygenase hydroxylase com- 
ponent. Two cysteine residues and two histidine residues, 
conserved among all the hydroxylase subunits, were found 
to align with conserved residues in Rieske iron-sulfur pro- 
teins (Fig. 4). As discussed below, these residues may bind 
a Rieske-type [2Fe-2S] cluster (53). 
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FIG. 4. Two conserved cysteines and histidines may coordinate [2Fe-2S] clusters. Similar alignments of these residues/enclosed in boxes, 
suggested that the aromatic dioxygenase subunits, BenA, XylX, BnzA (PI) (27), TodCl (79), and NdoB (34), are Rieske-type iron-sulfur 
proteins. The small iron-sulfur proteins, TodBl, NdoA, and BnzC (P3), of the toluene, naphthalene, and benzene dioxygenases, respectively, 
also have cysteines and histidines similarly aligned with those of the Rieske iron-sulfur proteins of Rattus norvegicus (R.n.) (43), Rhodobacter 
capsulatus (R.c.) (10), S. cerevisiae (S.c.) (3), Paracoccus denitrificans (P.d.) (35), and Bradyrhizobium japonicum (Bj.) (66). Residues 
identical in all aligned positions are indicated by asterisks. 
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BenB MNATALLDTISIEQISQFLYSEARFLDDEQWDDWLE-CYAPQASFWMPAWDDN DQLTENPQTEISLIYYPDRQGLED 

XylY MTISYEAVRDFLYREARYLDDKQWESWLE-MYAPDATFWMPAWDDR DQLTEDPQSQISLIWYGNRSGLED 

NdoC MMINIQEDKLVSAHDAEEILRFFNCHDSALQQEATTLLTQEAHLLDIQAYRAWLEHCVGSEVQYQVISRELRAASERR — YKLNEAMNVYNENFQQLKV 
BnzB(P2) MIDSANRADVFLRKPAPVAPELQHEVEQFYYWEAKLLNDRRFEEWFAL-LAEDIHYFMPIRTTRIMRDSRLEYSGSREYAHFDDDATMMKG 

TodC2 MIDSANRADVFLRKPAPVAPELQHEVEQFYYWEAKLLNDRRFEEWFAL-LAEDIH YFMP IRTTRIMRDSRLEYSGSREYAHFDDDATMMKG 

** * * t 

BenB RVFRIKTERSSATMPDTRTAHNISNIEVESRDGLQITVRFNWNTLSFRYK NSYSYFGMSRYVI — DFSGEQPKILSKYVMLKNDYINQVIDIYHI 

XylY RVFRIKTERSSATIPDTRTSHNISNLELLEQSDGVCKLRYNWHTMNYRYK TVDHFFGTNFCTL — DTCGETPLITAKKWLKNDYIRQVIDVYHV 

NdoC RVEHQLDPQNWGNSPKLRFTRFITNVQAAMDVNDKELLHIRSNVILHRARRGNQVDVFYAAREDKWKRGEGGVR-KLVQRFVDYPERILQTHNLMVFL 
BnzB(P2) RLRKITSDVSWSENPASRTRHLVSNVMI-VGAEAEGEYEISSAFIVYRNRLERQLDIFAGERRDTLRRNTSEAGFEIVNRTILIDQSTILANNLSFFF 
TodC2 RLRKITSDVSWSENPASRTRHLVSNVMI-VGAEAEGEYEISSAFIVYRNRLERQLDIFAGERRDTLRRNTSEAGFEIVNRTILIDQSTILANNLSFFF 
* * * * * 

FIG. 5. Alignment of the p-subunits of the aromatic hydroxylase components. BenB, XylY, NdoC (34), BnzB (P2) (27), and TodC2 (79) 
are the approximately 20-kDa proteins of the benzoate, toluate, naphthalene, benzene, and toluene dioxygenases, respectively. Nine residues 
marked by asterisks are identical in all sequences. Dashes indicate sequence gaps. 


Amino acid sequence similarities were also discovered 
among BenB, XylY, NdoC (34), BnzB (P2) (27), and TodC2 
(79) (Fig. 5). In the aligned sequences of BenB and XylY, 
56% of the amino acids were identical. In pairwise com- 
parisons of BenB with each of the other sequences shown 
in Fig. 5, 16 to 18% of the amino acids were identical. 
Although these values are relatively low, the sequence 
similarities were shown to be significant by the method of 
Needleman and Wunsch (40), with similarity scores above 
4.2. Each of these proteins is an approximately 20-kDa 
P-subunit of a hydroxylase, and nine residues, marked by 
asterisks in Fig. 5, were found to be invariant among the 
protein sequences. 

The BenC and XylZ electron transfer components were 
compared with both ferredoxins and oxidoreductases. BenC 
was similar to XylZ (49% identical residues) but not to any 
components of the aromatic dioxygenases with two-protein 
electron transfer systems. Homology was detected, how- 
ever, between BenC and XylA, a protein component of 
xylene monooxygenase (62) (Fig. 6). XylA may be involved 
in the transfer of electrons from NADH to XylM, the 
hydroxylase component of this two-subunit monooxygen- 
ase. The BenC and XylA sequence similarities extended 
throughout the entire length of the proteins; 28% of the 
aligned residues were identical. 

As shown in Fig. 6 and 7, the N-terminal region of BenC 
was found to resemble chloroplast-type ferredoxins. Resi- 
dues important for ferredoxin structure and function (4, 49) 
were conserved in BenC, including the four cysteine resi- 
dues which coordinate [2Fe-2S] clusters. In addition, this 
ferredoxinlike region of BenC was similar to the C-terminal 
region of VanB, a 33-kDa protein involved in vanillate 
demethylation (5) (Fig. 6). In this region, 24% of the aligned 
VanB and BenC amino acid residues were identical, includ- 
ing those, like the four cysteines, which are conserved 
among chloroplast-type ferredoxins. 

The C-terminal region of BenC was found to be homolo- 
gous to the C-terminal regions of ferredoxin-NADP + re- 
ductases from spinach (30) and from Spirulina species (77). 
In the alignment shown in Fig. 6, 20% of the BenC C-termi- 
nal residues were identical to those of a ferredoxin reduc- 
tase. A stretch of roughly 130 amino acids between positions 
190 and 320 of BenC was also found to resemble a similarly 
sized region of two other oxidoreductases: NADPH-cy- 
tochrome P-450 reductase isolated from Saccharomyces 
cerevisiae (72), pigs (18), rats (50), and rabbits (31); and 
NADPH- sulfite reductase isolated from E. coli and Salmo- 


nella typhimurium (46, 48). In this region, the identity 
between BenC amino acid residues and those of each aligned 
oxidoreductase was 21 to 37%. The regions of strongest 
homology, shown in Fig. 8, may be involved in NAD(P) and 
FAD binding. 


DISCUSSION 

Nonaromatic c/s-diols are common intermediates in the 
degradation of a wide variety of aromatic compounds. Such 
diols are usually formed by reactions catalyzed by multicom- 
ponent dioxygenases. All the hydroxylating dioxygenases 
studied to date have been composed of a hydroxylase 
component, usually composed of nonidentical a- and {3-sub- 
units, and a one- or two-protein electron transfer system (9). 
In this study, we compared the sequences encoding A. 
calcoaceticus benzoate 1,2-dioxygenase with those of the P. 
putida pWWO TOL plasmid encoding toluate 1,2-dioxygen- 
ase. In addition, comparisons were made with sequences for 
functionally related enzymes. 

The hydroxylase components of various multicomponent 
dioxygenases were found to be similar. The sizes of the a and 
(3 subunits in the different dioxygenases discussed are 
roughly 50 and 20 kDa, respectively. A comparison of the 
amino acid sequences of the benzoate, toluate, benzene, 
toluene, and naphthalene 1,2-dioxygenases suggested that 
the a-subunits and the |3- subunits are each derived from 
common ancestors. The conservation of amino acid se- 
quences was strongest in the N-terminal region of the 
a-polypeptides. This region may be responsible for com- 
mon hydroxylase functions such as recognition and activa- 
tion of two oxidizable carbons of the aromatic ring, recep- 
tion of electrons from the electron transfer components, 
binding to the p-subunit, and binding and activation of 
oxygen. 

The binding of oxygen, which usually involves metal 
cofactors, may be mediated by mononuclear Fe(II). One 
such iron was shown to be associated with each a-subunit of 
the benzoate 1,2-dioxygenase from P. arvilla (75). The iron 
atom is likely to be coordinated by tyrosine and histidine 
residues of the polypeptide. Five invariant histidines and 
two invariant tyrosines were detected in the alignment of the 
five a-subunits (Fig. 3). Some of these residues may be 
involved in iron binding. 

The arrangement of two of the invariant histidines with 
two conserved cysteine residues (Fig. 4), however, sug- 
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VanB 214 aa~YFAAPVQPAGDARAFEGRLARSGLTLQVP — AERSVAQVLDDAGVC I P LACEQGI CGTCL 


F r dox ATYKVTLI NEEEGINAI LEVADDQTI LDAGEEAGLDLP S SCRAGSCSTCA 

Xy 1A KKI SGLFVPPPESTVSVRGQGFQFKVPRGQTILESALHQGIAFPHDCKVGSCGTCK 

BenC MSLYLNRIPAMSNHQVALQFEDGVTRFICIAQGETLSDAAYRQQINIPMDCREGECGTCR 

XylZ mthkvatdfedgvtrfidantgetvadaayrqginlpldcrdgacgack 

* * * 

VanB TRVLDGEPEH RDSFLTDAERARNDQFTPCCSRARSACLVLDL 

Frdox GKLVSGAAPN QDDQAFLDDDQLAA-GWVMTCVAYPTGDCTIMTHQESEVL 

XylA YKLISGRVNE — LTSSAMGLSGDLYQS-GYRLGCQCIPKEDLEIELDTVLGQALVPIETSAL — 

BenC afcesgnydmpednyiedaltpeeaqq-gyvlacqcrptsdavfqiqassevcktkihhfeg — 

XylZ cfaesgrysl-geeyiedalseaeaeq-gyvltcqmraesdcvirvpaasdvcktqqagyqa — 

FNR1 aktdipvniyi — 

FNR2 QIASDVEAPPPAPAKVEKHSKMEEGITVNKFK — 


XylA ISQKRLHFEAHDIVEMEWPD KQIAFYPGQYADVECAECSA 

BenC TLARVENLSDSTITFDIQLDDGQ PDIHFLAGQYVNVTLPGTTE 

XylZ AISNVRQLSESTIALSIKSASL NQLAFLPGQYVNLQVPGSDQ 

FNR1 ykpknpyigkclsnee lvreggtgtvrhlifdisggdlry legosigiippgtdnngkph 

FNR2 FKP KTPYYGRCLLNTKITGDDAPGETWHMVFS -HEGEIPYREGOSVGVIPDGEDKNGKPH 

FAD-ppi binding region of FNR 


* * * 

PPQPDGSLSFHVRLVP GGVFSFWLFGGDRTFATLTLR 

QPGNRLTG-FWRNVP QGKMSEYLSVQAKAGDKMSFT 

LQKDGEVS-FL IRKLP GGLMS SFLTS LAKVGD S V S LA 

TRHGnHVDnTWSLCVROLEY KtfPErGErVYGVCSrYLCNL -EAGADVAXl* 
RALGDFGDAKSVSLCVKRLIYTN- DAGErJfrgVCEWELCPL-fCPGAEVKLr 
FAD-isoalloxazine ring-binding NADP-ppi binding region of FNR 


XylA 

-V 

RSYSFSA 

BenC 

-T 

RSYSFSS 

XylZ 

-T 

RAYSFSS 

FNR1 

KL 

RLYSIAS 

FNR2 

KL 

RLYSIAS 


XylA 

BenC 

XylZ 

FNR1 

FNR2 


RD — VLLFFGA-RQQ 

H p VRLVFG-VTQ 

HP LHLIYG-VTH 

DYKFKGLAWLFFGIPYSP 
DYKFNGLAWLFLGVPTSS 

NAD (P) -ribose binding region 


APYGQFGL — HE 
GPFGSFYL — RD 
GPLGAFYL — RE 


GP VGKEMIM P KD 


SNATMVCVAGGTGLAP I KCVLQSMTQAQRE 
| VKRPVLMLAGGTGIAPFLSMLQVLEQKGSE 
I KRP LLLLAGGTGLAPFTAMLEKI AEQGGE 
EDATI IMMATGTGI APFRAFLWRIFKEQHE 
PNATI IMLGTGTGI APFRSFLWKMFFEKHD 


XylA RDLYCLDEIEALQLDWGGRFELI-PVLSEESSTSSWKGKRGMVTEYFKEY — EYLTGQPY 


BenC DCDLVALEQLDALQQKLPWFEYR-TWAHAESQHERKGYVTGHIEY DWLNGGEV 

XylZ DHDLVEMDKLEAFAARIPNFSYS-ACVASPDSAYPQKGYVTQYIEP KQLNGGEV 


FNR1 NILYQQE-LEELQEEFPENFRLTLAISREQQNPEGGKMYIQDRIKENADQLWELIQKPNT 

FNR2 SLLYKEE-FEKMKEKAPDNFRLDFAVSREQTNEKGEKMYIQTRMAQYAVELWEMLKKDNT 


* * * * 

XylA EGYLCGPPPMVDAAETELVRLGVARELVFADRFYNRPPC 

BenC DVYLCGPVPMVEAVRSWLDTQGIQPANFLFEKFSAN 

Xy 1 Z DI YLCGPPPMVEAVSQYIRAQGIQPANFYYEKFAASA 

FNR1 HTYICGLKGMEGGIDEGMSAAAGKFDVDWSDYQKELKKKHRWHVETY 

FNR2 YVYMCGLKGMEKGIDDIMYSLAAAEGIDWIEYKRQLKKAEQWNVEVY 

FIG. 6. Alignment of BenC and XylZ oxidoreductases with related sequences. The N-terminal portions of the proteins resemble 
chloroplast-type ferredoxins such as Aphanothece sacrum II (Frdox) (25) and a region at the C terminus of a Pseudomonas vanillate 
demethylase component (VanB) (5). The C-terminal regions of BenC and XylZ resemble ferredoxin-NADP + oxidoreductases from Spirulina 
species (FNR1) (77) and spinach (FNR2) (30). XylA, a component of xylene monooxygenase (62), is similar throughout the length of the 
proteins. Boxes indicate possible FAD-isoalloxazine ring- and N A D(P)- ribose- binding regions. Portions of FNR1 and -2 which may bind the 
FAD-PPj moiety are underlined. Portions of FNR1 and -2 which may bind the NADP-PP; moiety are underlined and italicized. Asterisks 
indicate identical residues in the aligned sequences, and dashes indicate gaps. 


gested that these particular residues are involved in the 
coordination of a Rieske-type (53) iron-sulfur cluster. The 
biochemical study of benzoate 1,2-dioxygenase demon- 
strated that one [2Fe-2S] iron-sulfur center is associated with 
each a-subunit (75). Iron-sulfur center binding is mediated 


by four cysteines in proteins such as ferredoxins, whereas 
coordination is mediated by two cysteines and two histidines 
in Rieske iron-sulfur proteins (11, 15). These small (roughly 
20-kDa) electron transport proteins are characterized by a 
relatively high redox potential (67). 
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FIG. 7. The N-terminal regions of the BenC and XylZ electron transfer proteins resemble chloroplast-type ferredoxins (ferl to -10). 
Conserved cysteines which bind a [2Fe-2S] center are enclosed in boxes. Additional conserved residues are indicated by asterisks, and gaps 
are marked by dashes. Ferredoxins 1 to 10 are from the following organisms: 1, BumiUeriopsis filiformis (26); 2, Anabaena variabiUs (68); 3, 
Anacystis nidulans (52); 4, Porphyra umbiticatis (64); 5, Spirulina platensis (23); 6, Spirulina maxima (65); 7, Chlorogloeopsis fritschia (63); 
8, Aphanizomenon flos-aquae (36); 9, Synechocystis sp. (22); 10, Halobacterium halobium (24). 


The amino acid alignment shown in Fig. 4 substantiates 
the idea that the hydroxylase a-subunits of the benzoate, 
toluate, benzene, toluene, and naphthalene dioxygenases are 
all Rieske-type iron-sulfur proteins. This suggestion has 
been made previously on the basis of the physical properties 
of some of the dioxygenases (11). Spectroscopic studies of 
the functionally similar phthalate dioxygenase from Pseudo- 
monas cepacia have clearly shown that in this enzyme, two 
histidines coordinate a [2Fe-2S] Rieske-type cluster (15). 
The specific residues involved in iron-sulfur binding by 
Rieske proteins have not previously been identified, but the 
sequence alignment presented in Fig. 4 suggests that they 
would be the conserved amino acids which are enclosed in 
boxes in the figure. 

The low degree of homology detected among the five 
3-subunits of the hydroxylase components suggested that 
these subunits are not directly involved in the common 
catalytic functions of the dioxygenases. The product of the 
xylY gene, the 3-subunit of toluate 1,2-dioxygenase, appears 
to be important for the determination of the substrate 
specificity of this enzyme (21). Of the nine conserved resi- 
dues among the 3-subunits, five are charged amino acids. It 
is possible that, as for the association between cytochromes 
c and b 5 (70), the charged amino acids play a role in the 
association between the a- and 3-subunits of the hydroxy- 
lases. 

The sizes and numbers of the electron transfer components 


of the multicomponent aromatic oxygenases vary. A compar- 
ison of the available sequences of electron transfer compo- 
nents suggested that variation reflects different evolutionary 
ancestry of the electron transfer proteins. In the naphtha- 
lene, toluene, and benzene 1,2-dioxygenases, two redox 
proteins are involved in the transfer of electrons from 
NADH to the hydroxylase component. One is a flavoprotein 
that has NADH oxidase activity, and the other is a ferre- 
doxin, which has one Rieske-type [2Fe-2S] cluster. In the 
benzoate and toluate 1,2-dioxygenases, a single protein that 
contains both flavin and a [2Fe-2S] center performs the 
electron transfer. BenC and XylZ were each found to have a 
ferredoxinlike N-terminal region and an oxidoreductaselike 
C-terminal region (Fig. 6). Neither of these regions appeared 
to be homologous to their functional counterparts in the 
toluene and benzene dioxygenase systems. 

The benzoate and toluate dioxygenases use a chloroplast- 
type ferredoxin for transferring electrons to the terminal 
hydroxylase. As shown in Fig. 7, the N-terminal regions of 
BenC and XylZ are homologous to chloroplast-type ferre- 
doxins. Many residues in the 12 aligned sequences are 
identical, including the four cysteines which coordinate a 
[2Fe-2S] center in the ferredoxins (49). Similar examples of 
a ferredoxinlike region occurring within a larger protein were 
found in two monooxygenase components, VanB (5) and 
XylA (62) (Fig. 6). In VanB, the ferredoxinlike region occurs 
in the C-terminal portion of the protein. Most examples to 


Possible NAD-ribose binding 
region of BenC and XylZ 


Fossinie FAD-isoaiioxazme ring 
binding region of BenC and XylZ 


BenC 

220 

KRP VLML AGGT G I AP FL S MLQ VL 






***.*.******.*** ** . 




XylZ 

207 

KRPLLLLAGGTGLAPFTAMLEKI 

BenC 

167 

RSYSFSS 



**;*.; ** ***** ; * 

XylZ 

153 

RAYSFSS 

SR(E.c.) 

452 

ETPVIMIGPGTGIAPFRAFMQQR 

SR(S.t.) 

386 

RLYSIAS 



**.*.. ******* * .* 

SR (E . c . ) 

386 

RLYSIAS 

SR (S . t . ) 

452 

ETPVIMIGPGTGIAPFRSFMQQR 

P450R (rat) 

454 

RYYSIAS 



**.*.. ***.***.. . * 

P450R (pig) 

453 

RYYSIAS 

P450R(pig> 

524 

TTP VIMVGPGTGVAPF I GF IQER 

b5R 

66 

RPYTPIS 



**;*.. *******.. . * 




P450R(rat) 

525 

TTPVIMVGPGTGIAPFMGFIQER 






*• *** **.* * * *** 




VanB 

104 

SSRSLLFAGGIGITPILAMAQVL 






. * *.****** * * . .. 




b5R 

146 

VKSVGMIAGGTGITPMLQVIRAI 





FIG. 8. Possible NAD- and FAD-binding regions of aromatic dioxygenase flavoproteins BenC, XylZ, BnzD (P4) (27), and TodA (79). 
Possible cofactor-binding regions of BenC and XylZ are aligned with those of sulfite reductase of E. colt , [SR(E.c.)] and 5. typhimurium 
[SR(S.t.)] (46), cytochrome P-450 oxidoreductase of pig, [P450R(pig)l (18) and rat [P450R(rat)] (50), vanillate demethylase (VanB) (5), and 
cytochrome b 5 reductase (b5R) (33). An asterisk above a residue indicates identity with the BenC amino acid; a colon indicates similarity. 
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date of ferredoxinlike segments in larger proteins involve 
“bacterial-type” [4Fe-4S] centers in complex membrane 
proteins (4). 

The benzene, naphthalene, and toluene dioxygenases have 
separate electron transport proteins that resemble Rieske 
iron-sulfur proteins. Each of these dioxygenases has a small 
12- to 15-kDa electron transfer component containing a 
[2Fe-2S] center (27, 34, 38, 79). Previous observations of 
redox potential and spectroscopic properties suggested that 
these small proteins were unlike previously characterized 
ferredoxins (38). Some similarities to Rieske iron-sulfur 
proteins have been noted (17, 61). As shown in Fig. 4, similar 
alignments of two cysteines and two histidines in the Rieske- 
type hydroxylase components, the Rieske iron-sulfur pro- 
teins, and the toluene, naphthalene, and benzene “ferre- 
doxin” components suggest that these electron transport 
proteins of the dioxygenases are more similar to Rieske 
iron-sulfur proteins than to other ferredoxins. 

The protein components of benzoate and toluate dioxygen- 
ases involved in NADH oxidation are dissimilar to those of the 
benzene and toluene dioxygenases. The similarities observed 
between the BenC and XylZ C-terminal regions and a 
number of oxidoreductases (Fig. 6 and 8) suggested that 
these portions of the proteins are responsible for direct 
interaction with NADH. One of the oxidoreductases resem- 
bling BenC and XylZ is the NADPH-sulfite reductase in- 
volved in cysteine biosynthesis (46, 48). It is interesting that 
homology has previously been found between two regula- 
tory proteins, one controlling transcription of the genes for 
sulfite reductase in E. coli and S. typhimurium (47) and the 
other controlling transcription of genes needed for benzoate 
metabolism in A. calcoaceticus (41). 

Sequence comparisons failed to reveal significant similar- 
ity between BenC and the reductase components of the 
benzene and toluene dioxygenases that closely resemble 
each other (27, 79). We found two regions of roughly 30 
amino acids in BnzD (P4) and TodA, identical in both 
proteins, which fit the consensus sequence (55, 71) for a 
secondary structure (tap-fold. These regions, which are 
likely to be involved in binding FAD and NAD, start at 
amino acid positions 3 and 145 in the protein sequences of 
BnzD (P4) and TodA. No sequences within BenC or XylZ fit 
this consensus sequence, although regions involved in co- 
factor binding by these proteins may still have pa p- type 
secondary structures. 

The most highly conserved regions in the alignments of 
BenC and XylZ with FAD- and NAD(P)-binding oxidoreduc- 
tases are likely to be those of cofactor binding (Fig. 6 and 8). 
The regions shown in Fig. 8 which are comparable to that of 
BenC near amino acid position 220 have previously been 
suggested to play a role in NADP-ribose binding (18, 46, 50). 
The additional short, highly conserved region depicted in 
Fig. 8 has previously been suggested to be part of a region 
which binds the FAD isoalloxazine ring in spinach ferre- 
doxin-NADP + reductase and cytochrome P-450 reductase 
(30, 50, 72). 

The regions of the ferredoxin reductases underlined in Fig. 
6 may play a role in binding FAD-PP, and NAIXPFPPj 
moieties (50). The underlined ferredoxin reductase se- 
quences and those of BenC and XylZ are similar, with six 
amino acids identical in all aligned sequences and with 27% 
of the BenC amino acid residues identical to those of a 
reductase. Gaps have been introduced in the sequence 
alignments, however, and the role these regions play in 
cofactor binding by BenC or XylZ remains to be determined. 

Although the sequence of the oxidoreductase of the naph- 
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thalene dioxygenase is not presently available, biochemical 
studies suggest that this protein is similar to the benzoate 
and toluate dioxygenase oxidoreductases (16). The sizes of 
these three proteins are similar, and spectroscopic studies of 
the NADH-ferredoxin NAP reductase suggest that it is a flavin 
protein with a [2Fe-2S] chloroplast-type, ferredoxin. This 
chloroplast-type ferredoxin functions in conjunction with the 
separate Rieske-type ferredoxin NAP protein. The naphtha- 
lene dioxygenase system, therefore, appears to be unique in 
having two proteins and three redox centers for the transfer 
of electrons to the terminal hydroxylase. 

The xylene monooxygenase (62) also has an electron 
transport component like those of the benzoate and toluate 
dioxygenases (Fig. 6). The hydroxy lating region of this 
monooxygenase, however, was not found to be similar to the 
hydroxylase components of the dioxygenases which are 
related to each other. It therefore appears that there are at 
least two classes of oxygenase components and at least three 
classes of electron transfer components in multicomponent 
oxygenases. The partnerships of hydroxylase and electron 
transfer components seem to have changed during the 
course of evolution. 
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Comamonas testosteroni T-2 uses a standard, if seldom examined,, attack on an aromatic compound and 
oxygenates the side chain of p-toluenesulfonate (TS) (or /^-toluenecarboxylate) to /?-sulfobenzoate (or tereph- 
thalate) prior to complete oxidation. The expression of the first three catabolic enzymes in the pathway, the TS 
methyl-monooxygenase system (comprising reductase B and oxygenase M; TsaMB), p-sulfobenzyl alcohol 
dehydrogenase (TsaC), and /7-sulfobenzaldehyde dehydrogenase (TsaD), is coregulated as regulatory unit R1 
(H. R. Schtafli Oppenberg, G. Chen, T. Leisinger, and A. M. Cook, Microbiology [Reading] 141:1891-1899, 
1995). The components of the oxygenase system were repurified, and the N-terminal amino acid sequences were 
confirmed and extended. An internal sequence of TsaM was obtained, and the identity of the [2Fe-2S] Rieske 
center was confirmed by electron paramagnetic resonance spectroscopy. We purified both dehydrogenases 
(TsaC and TsaD) and determined their molecular weights and N-terminal amino acid sequences. Oligonucle- 
otides derived from the partial sequences of TsaM were used to identify cloned DNAfrom strain T-2, and about 
6 kb of contiguous cloned DNA was sequenced. Regulatory unit R1 was presumed to represent a four-gene 
operon (tsaMBCD) which was regulated by the LysR-type regulator, TsaR, encoded by a deduced one-gene 
transcriptional unit. The genes for the inducible TS transport system were not at this locus. The oxygenase 
system was confirmed to be a class IA mononuclear iron oxygenase, and class IA can now be seen to have two 
evolutionary groups, the monooxygenases and the dioxygenases, though the divergence is limited to the 
oxygenase components. The alcohol dehydrogenase TsaC was confirmed to belong to the short-chain, zinc- 
independent dehydrogenases, and the aldehyde dehydrogenase TsaD was found to resemble several other 
aldehyde dehydrogenases. The operon and its putative regulator are compared with units of the TOL plasmid. 


Aromatic sulfonates are produced in the multimillion tonne 
range annually and dispersed in the environment as detergents, 
dyestuffs, and additives to products as diverse as ink and engine 
oil (24). These charged compounds are now readily found in 
the environment (see, e.g., reference 38); defined, naturally 
occurring sulfonates, usually aliphatic compounds, are also 
known (64; see also reference 76). At least five different mech- 
anisms for the desulfonation of the aromatic compounds in 
aerobes are known (17, 21, 34, 39, 84; see also reference 16), 
and different organisms can have different strategies to attack, 
e.g.,p-toluenesulfonate as a carbon source (21, 34, 61). There 
is no information on the genetics of these pathways, and in only 
one case is an extensive understanding of the enzymology and 
physiological controls available, the degradation of p-toluene- 
sulfonate by Comamonas testosteroni T-2 (35, 42, 61). 

C. testosteroni T-2, grown with toluenesulfonate as the sole 
source of carbon and energy, synthesizes an inducible, specific, 
secondary proton symport system (42) and inducible enzymes 
that convert toluenesulfonate (or toluenecarboxylate) to sulfo- 
benzoate (or terephthalate) in three steps (Fig. 1) (41). These 
reactions require four enzymes, now termed TsaMB (/?- tolu- 
enesulfonate methyl-monooxygenase system, comprising re- 
ductase B and oxygenase M), TsaC (p-sulfobenzyl alcohol 
dehydrogenase) and TsaD (p-sulfobenzaldehyde dehydroge- 
nase), which are synthesized in one regulatory unit, termed R1 
(61). 
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The pathway (Fig. 1) bears superficial comparison with the 
upper pathway of the TOL plasmid (see, e.g., reference 3) and 
is thus representative of a major mode of attack on aromatic 
compounds: degradation initiated at the alkyl side chain (41). 
The superficiality is immediately visible at the enzyme level. 
Oxygenase TsaMB (strain T-2) has been attributed to the class 
LA multicomponent, mononuclear iron oxygenases (5, 44, 51), 
whereas oxygenase XylMA (TOL plasmid) belongs to a class of 
multicomponent, di-iron monooxygenases (65, 70); the alcohol 
dehydrogenases seem to belong to different classes (41, 66), 
and the same was believed to be true for the aldehyde dehy- 
drogenases (41). 

Whereas it is reasonable to hypothesize that regulatory unit 
R1 (Fig. 1) (61) represents an operon (see, e.g., references 74 
and 83), it is impossible to anticipate the type of regulation 
involved, e.g., LysR-, XylS/AraC-, or XylR-type (33, 53, 59). In 
addition, the (presumptive) genes encoding the inducible 
transport system for toluenesulfonate may be an integral part 
of an operon encoding degradative enzymes (49, 80) located 
dose to the degradative operon (52) or found at another locus 
(48). The transport system for toluenesulfonate does not trans- 
port toluenecarboxylate (42). 

We now report more information, including two enzyme 
purifications (TsaC and TsaD), on the four enzymes of regu- 
latory unit R1 in C. testosteroni T-2 (Fig. 1) (61), and we re- 
port that R1 is indeed an operon comprising tsaMBCD , which 
is presumably under the control of a putative transcriptional 
LysR-type regulator, TsaR. The genes encoding the transport 
system(s) are distant from this structure. This is the first report 
characterizing both the enzymes and the nucleotide sequence 
encoding the attack on the side chain of an aromatic compound. 
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4-Tola enesnlfonate 4-Snlfobenzy! 4-Solfo- 4-Sulfobenzoate 

alcohol benzaldehyde 

R = SOJ or COO' 


FIG. 1. The reactions catalyzed by the monooxygenase system TsaMB, the 
alcohol dehydrogenase TsaC, and the aldehyde dehydrogenase TsaD of regula- 
tory unit R1 (61) of C. testosteroni T-2. 


MATERIALS AND METHODS 

Bacteria, growth conditions and plasmids. C. testosteroni T-2 (DSM 6577) was 
grown in hatch culture at30°C in mineral salts medium, pi I 7.2, containing 6 mM 
loluenesulfonale (40). Most cells for enzyme purifications were grown in a 
12.5-liter fermentor with an 8- to 10-liter working volume (Biostat V; Braun, 
Melsungen, Germany), harvested early (60 mg of protein/liter, to avoid problems 
with foaming and clumping) in a Pellikon cassette filtration system (Milliporc, 
Neu-Isenhurg, Germany), washed by centrifugation, and stored frozen, as in our 
earlier work (39). When vanillate or p-methoxybenzoate (6 mM) was used as the 
carbon source, the pH was adjusted to 7.0. 

Escherichia coli DII5a was used as the host for pBlucscripl-KSH constructs 
and was routinely grown in Luria broth containing 10U p.g of ampicillin per ml at 
37°C. 

Preparation of cell extracts, enzyme assays, and enzyme purifications. Cells 
were disrupted in a French pressure cell, and the crude extract, in 20 mM 
phosphate buffer (pH 7.0) with or without 1 mM dilhiothreilol, was prepared as 
described elsewhere (39). Reductase TsaB was assayed at 25°C by the reduction 
of cytochrome c (39). Monooxygenase TsaM was assayed by oxygen uptake in a 
coupled assay with the reductase (40). Each component (TsaB and TsaM) was 
purified in two chromatographic steps, anion exchange and hydrophobic inter- 
action (40). 

Cell extract for the purification of alcohol dehydrogenase TsaC and aldehyde 
dehydrogenase TsaD was prepared without dithiothreitol. Activities were as- 
sayed at room temperature (21°C) by oxidation (TsaD) or reduction (TsaC) of 
p-carboxybenzaldehyde as described elsewhere (41). TsaC was purified in two 
chromatographic steps (anion exchange and hydrophobic interaction), essentially 
as described elsewhere (41). A new purification scheme for TsaD was developed, 
involving an anion exchange and subsequent gel filtration. The an ion -exchange 
column (Mono Q HR 10/10; Pharmacia, Uppsala, Sweden) was equilibrated with 
20 mM phosphate buffer, pH 6.8 (buffer A), and proteins were eluted with an 
increasing gradient of sodium sulfate (to 1 M, buffer B) at a flow rate of 4 ml/min 
as follows: from 0 to 40 min, 100% buffer A; from 40 to 68 min, 0 to 3% buffer 
B; from 68 to 130 min, 3 to 6% buffer B; from 130 to 200 min, 6 to 20% buffer 
B. Four-milliliter fractions were collected, and fraction 17 was worked up. In the 
second purification step the gel filtration column (Superose 12 HR 10/30; Phar- 
macia) was equilibrated with 150 mM sodium sulfate in 50 mM phosphate buffer, 
pH 6.8, at a flow rate of 0.5 ml/min and a sample volume of 300 p.1 was used. 

Only the fraction of TsaC or TsaD with the highest specific activity was used 
for further work. Purifications were monitored by sodium dodecyl sulfate-poly- 
acrylamide gel electrophoresis (SDS-PAGE) with Coomassie staining (37, 40). 
Purified protein bands were blotted to a polyvinylidenefluoride membrane (Im- 



[g] 

FIG. 2. EPR spectrum of the oxygenase component TsaM of monooxygenase 
system TsaMB from C. testosteroni T-2. 



FIG. 3. SDS-PAGE, with Coomassie blue staining, of the products of puri- 
fication of p-sulfobcnzaldchydc dehydrogenase (TsaD) from C. testosteroni T-2. 
Lane A, molecular weight markers; lane B, crude extract (40 jig of protein); lane 
C, fraction from the anion exchanger (7 p,g of protein); lane D, fraction from the 
gel filtration column (1 p,g of protein). 


mobilon-P; Millipore, Bedford, Mass.) and subjected to Edman degradation (39, 
40). 

Protein samples were concentrated by membrane filtration (Centricon-10; 
Amicon, Danvers, Mass.). The hydrophobic TsaD bound extensively to the filter 
unit unless the unit was first incubated overnight in 5% aqueous Tween 20 
solution and then rinsed thoroughly with water. 

Determinations of the molecular weights under native conditions were done 
with the gel filtration column. The sample volume for molecular weight markers 
(40) and samples was 300 til. 

EPR. Electron paramagnetic resonance (EPR) spectra of monooxygenase 
TsaM (about 2 mg in 0.3 ml) in the X band were recorded with an EPR 300 
spectrometer (Bruker, Rheinstetten, Germany) at 10 K as described elsewhere 
(55). 

Cyanogen bromide cleavage. Purified TsaM was separated on an SDS-12% 
PAGE gel, visualized by the Zn 2+ reverse-staining method (18), and cut from the 
gel. The gel chip was minced, suspended in 500 p,l of 0.1% CNBr, flushed with 
nitrogen, and allowed to react in the dark for 20 h. The reaction mixture was 
incubated for 2 h at 37°C with extraction solution (50% formic acid, 25% 
acetonitrile, 15% propanol, 10% H 2 0) and then centrifuged (12,000 X g, 10 
min). The supernatant fluid was taken to dryness in a vacuum centrifuge. The 
digest was dissolved in water, and protein fragments were separated by reversed - 
phase chromatography (ProRPC HR 10/10; Pharmacia). The column was equil- 
ibrated with 0.1% trifluoroacetic acid (0.5 ml/min), and the sample was eluted 
with an increasing gradient of methanol (0 to 100% in 50 min). The fragments 
eluted in two major peaks. All protein fragments of peak two were pooled and 
dried in a vacuum centrifuge. The pellet was dissolved in loading buffer and 
separated by SDS-PAGE in a 15% gel. Peptides were then stained with Coo- 
massic brilliant blue. Three bands (14, 11.5, and 9.0 kDa) were identified, blotted 
onto a polyvinylidenefluoride membrane, and subjected to Edman degradation 
(see above). The N -terminal amino acids of the 9.0-kDa fragment were used to 
deduce an oligonucleotide probe. 

Enzymes and reagents. All restriction enzymes, Klcnow fragments, T4 DNA 
ligases, and polynucleotide kinases were obtained from Fermenlas, Vilnius, 
Lithuania. [a-^SJdATP for sequencing and [y-^PJdATP for oligonucleotide 
labeling were obtained from Dupont (Regensdorf, Switzerland). 

All reagents for growth media and protein purification were purchased from 
Fluka (Buchs, Switzerland). 

Preparation of total DNA and plasmid DNA Total DNA from a 100-ml batch 
culture (200 mg of protein/liter of culture) of toluenesulfonate-grown C. testo- 
steroni T-2 was prepared by the cetylfrimethylammomum bromide precipitation 
method (4). Plasmid DNA from E . coli was isolated with Nucleobond AX-100 
columns (Macherey-Nagel, Oensingen, Switzerland). 

DNA cloning, screening, sequencing, and analysis. Total DNA from strain T-2 
was digested with restriction enzymes and separated in a 1% agarose gel. The 
DNA was blotted onto a Hybond-N nylon membrane (Amersham International, 
Amersham, United Kingdom). Degenerate oligonucleotide probes were 5’ la- 
beled with [y- 32 P]dATP, and DNA fragments were labeled with [a-^SJdATP by 
random oligonucleotide primers (4). Both probes were used lor screening. DNA 
fragments were cut from the agarose gel and eluted by phenol -chloroform ex- 
traction (4). The eluted fragments were ligated into pBluescript-KSII and intro- 
duced via electroporation (4) into E. coli DH5a. All cloning was done with 
pBluescriptll. 

Nucleotide sequencing of both strands was performed by the modified dideoxy 
chain termination method (57, 63) from double-stranded DNA templates with 
Sequenase (U.S. Biochemical Corp., Cleveland, Ohio), and PCR-hased sequenc- 
ing was performed with fluorescence-labeled nucleotides (Microsynth, Balgach, 
Switzerland). Nucleic acid and amino acid sequences were analyzed by using the 
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TABLE 1. Purification of p-sulfobenzyl alcohol dehydrogenase 
TsaC from C. iestosteroni T-2 


Purification step 

Vol 

(ml) 

Protein Activity 
(pg) (nkat) 

Sp act 
(kat/kg) 

Yield 

(%) 

Purification 

(fold) 

Crude extract 

2 

45,000 

6.5 

0.15 

100 

1 

Anion exchanger 

4 

1,300 

1.8 

1.4 

27 

9.8 

Hydrophobic interaction 

0.5 

20 

0.26 

13.3 

4 

92 


Genetics Computer Group program package (University of Wisconsin, Madison, 
Wis.). The evolutionary tree was generated with the CLUSTAL W program (71). 

Nucleotide sequence accession number. The sequence data are available in the 
NCB1 OenBank library under accession number U 3 2622. 

RESULTS 

Biochemical characterizations and development of oligonu- 
cleotide probes for tsaMB. Both components of oxygenase 
TsaMB were repurified, and the molecular weights of the de- 
natured proteins (43,000 and 36,000) were confirmed (40). The 
N-terminal amino acid sequences were confirmed and ex- 
tended as follows: TsaM, MFIRNXWYVAAWDTEIPAEGL 
FHR; TsaB, ADVPVTVAAVRAVARDVLALELRHANGQ. 
The sequence of TsaM contained a suitable sequence of amino 
acids with low degeneracy of coding. The sequence of TsaB did 
not, and the amino acid composition (39), largely amino acids 
with high degeneracy of coding, implied that no such sequence 
could be expected. We therefore sequenced an internal frag- 
ment of TsaM to be able to derive a second oligonucleotide 
probe: TsaM int , KPGYIHYQANYKU\T>NLLDFTHLAXV 
HPT. 

The flavin content of the reductase component, TsaB, was 
consistent with the yellow color of the protein, and the ability 
of the protein to reduce cytochrome c indicated its [2Fe-2S] 
ferredoxin component (see reference 39). The nature of the 
[2Fe-2S] center in monooxygenase TsaM was presumed from 
its UV spectrum to be a Rieske center. We have now used 
EPR spectroscopy (Fig. 2; Table 4) to confirm the presence of 
the Rieske [2Fe-2S] center (see reference 40). 

Monooxygenase TsaMB has a broad substrate range which 
includesp-methoxybenzoate (40). The monooxygenase systems 
vanillate-demethylase (VanAB) and p-methoxybenzoate-de- 
methylase (MbdAB) reported in the literature (6, 9) are closely 
related to TsaMB (see below), and C. testosteroni T-2 utilized 
both p-methoxybenzoate and vanillate as sole sources of car- 
bon for growth. Extracts of toluenesulfonate-grown cells of 
strain T-2, however, could not oxygenate vanillate, so mono- 
oxygenase system TsaMB is not identical with the vanillate- 
demethylase synthesized by this organism. Similarly, extracts of 
p-methoxybenzoate-grown cells of strain T-2 could not convert 
toluenesulfonate to sulfobenzoate, so, because TsaMB is ex- 
pressed concomitantly with the dehydrogenases yielding the 
latter compound (61), monooxygenase system TsaMB is not 
identical with the p-methoxybenzoate-demethylase in this or- 
ganism. 

Purification of alcohol dehydrogenase TsaC to give identi- 
fication tags. TsaC was purified to apparent homogeneity in a 
two-step procedure which gave a 92-fold purification (Table 1). 
The N terminus was unambiguously determined to be MN 
LNKQVAIV. SDS gel electrophoresis and gel filtration chro- 
matography confirmed earlier data indicative of a dimeric na- 
tive protein with a subunit molecular weight of 29,000 (see 
reference 41). 

Purification of aldehyde dehydrogenase TsaD to give iden- 
tification tags. TsaD was extensively purified (Fig. 3) in a 
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TABLE 2. Purification of p-sulfobenzaldehyde dehydrogenase TsaD 
from C. testosteroni T-2 


Purification step 

Vol 

(ml) 

Protein 

(P6) 

Activity 

(jikat) 

Sp act 
(mkat/kg) 

Yield 

(%) 

Purification 
> (fold) 

Crude extract 

2 

70,700 

140 

2 , 

100 

1 

Anion 

4 

648 

124 

192 

88 

96 

exchanger 
Gel filtration 

0.5 

12.5 

6 

501 

4 

250 


two-step procedure which gave a 250-fold purification (Table 
2). The keys -to this purification were treatment of the concen- 
tration membranes to eliminate extensive protein losses due to 
binding (see Materials and Methods), the avoidance of pooled 
samples which introduced massive impurities, and a brisk rou- 
tine to obtain data before the enzyme lost activity. We thus 
avoided pitfalls which led to separation of the wrong protein 
(41). 

The subunit molecular weight determined by SDS-gel elec- 
trophoresis was 56,000. The protein smeared on gel filtration 
chromatography, presumably due to interaction of the ex- 
tremely hydrophobic protein with the stationary phase, and we 
assume a homodimeric native protein (Table 2). The N-termi- 
nal amino acids were determined to be XSTVLYRXPEL. 

Cloning, sequencing, and identification of the tsaMBCD and 
tsaR genes. We made a restriction digest map of the tsaM locus 
(Fig. 4) with the aid of an oligonucleotide probe representing 
the N terminus of TsaM (as indicated in Fig. 5) and confirmed 
the map with a probe representing the internal amino acid 
sequence (see above). Total DNA was subjected to digestion 
with SaR and Bgl II, and fragments in the 4-kb region, localized 
in a Southern blot with a probe for tsaM, were ligated into 
pBluescript-KSII. Clones (250) were screened with the probe 
for tsaM , and a single clone was identified. Both DNA strands 
of the 4-kb Sad-BgHl insert were partially sequenced. One 
complete (tsaR, see below) and one incomplete open reading 
frame (ORF) were detected. The incomplete ORF included 
the N-terminal and internal sequences from the monooxygen- 
ase component, TsaM. To complete this sequence, a 6-kb Kpnl 
fragment (Fig. 4), which overlaps the 4-kb Sad-BgKl fragment, 
was cloned and partially sequenced. The 6-kb fragment was 
identified with a 0.3-kb Kpnl-Bgill probe derived from the 4-kb 
Sad-Bglll clone. Downstream of tsaM we located a complete 
ORF, whose deduced sequence includes a section identical to 
the N-terminal sequence of reductase TsaB. In total, 6,040 bp 
were sequenced and five ORFs were found and analyzed (Fig. 
4 and 5). 

Sequence and characterization of monooxygenase TsaMB. 
The start codon (ATG) of monooxygenase tsaM was found at 
position 1477 (Fig. 5); the stop codon (TGA) was found at 
position 2518. We located a putative Shine-Dalgamo sequence 
(AAGGAG, position 1464) upstream of this gene (68). The 
start codon of reductase tsaB was found at position 2517, the 
stop codon was found at position 3468, and a putative Shine- 
Dalgarno sequence (CGGGA) was found at nucleotide 2505. 
The stop codon of tsaM (TGA) and the start codon of tsaB 
(ATG) overlap (Fig. 5). Gene tsaM has a coding capacity for 
347 amino acids, giving a calculated molecular weight (A/ rcaIc ) 
of 39,557, and gene tsaB has a coding capacity for 317 amino 
acids (M r cAlc = 34,186); these molecular weights corresponded 
to the values (43,000 and 36,000, respectively) obtained with 
SDS-PAGE. 

A sequence analysis of TsaM confirmed the presence of 
motifs attributed to a Rieske [2Fe-2S] center (e.g., Cys-48, 
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FIG. 4. Restriction digest map of the region containing operon tsaMBCD 
and tsaR. The two clones (SalUBglU and Kpnl) used fur sequencing are shown. 
Only the restriction sites used for subcloning are shown. Distances between two 
restriction sites are marked in base pairs. The orientation and location of tsaM, 
isaB, isaC, tsaD, and tsaR are shown with arrows. Si, Sail; P, Pstl; K, Kpnl; B, 
Bglll; A, Acd; S, Sad. 


Cys-67, His-50, and His-70) and a mononuclear iron center 
(Asp-155, His-158, and His-163) (Fig. 5) (see also references 
44 and 46) and confirmed the enzyme as a member of the class 
I A oxygenases (Table 3). 

Similarly, TsaB was confirmed as a typical class I A reductase 
(Table 3) with presumed sites for proR NADH binding (60) 
(Gly-117, Gly-118, Ile-119, Gly-120, Thr-122, Pro-123, Met- 
127, Arg-145, Pro-195, Phe-232), flavin mononucleotide bind- 
ing (Arg-51, Tyr-53, Ser-54, Leu-55), and a ferredoxin [2Fe-2S] 
center (Cys-266, Cys-271, Cys-274, Cys-304) (Fig. 5; see also 
references 44 and 46). 

Sequencing and characterization of p-sulfobenzyl alcohol 
dehydrogenase TsaC. Downstream of reductase gene tsaB 
(Fig. 4 and 5) we located an ORF (759 bp) termed tsaC . The 
stop codon of reductase tsaB and the start codon of tsaC were 
13 bp apart. The start codon of tsaC (ATG) was found at 
position 3484, the stop codon (TGA) was found at position 
4240, and a potential Shine-Dalgarno sequence (AACCA) was 
found at position 3473. The deduced amino acid sequence 
(Fig. 5) includes the N-terminal sequence determined for the 
p-sulfobenzyl alcohol dehydrogenase (Fig. 1), and the Af rca , c 
(26,438) corresponds to the observed value (29,000); tsaC thus 
encodes the alcohol dehydrogenase. TsaC has a high degree of 
sequence similarity (Table 3) with group II, short-chain, 
NAD(P)-dependent, zinc-independent alcohol dehydroge- 
nases (54) for which a crystal structure is available (25). The 
conserved residues Asp-62 (Fig. 5), Ser-142, Tyr-155, and Lys- 
159 may be involved in the catalytic mechanism, whereas the 
pattern Gly-12-XXX-Gly-16-X-Gly-18 (part of the apa mo- 
tif) and Thr-11 of the N terminus presumably represent con- 
served amino acids in the coenzyme binding domain. These 
data confirm the assumption (41) of the classification of this 
enzyme. 

Sequencing and characterization of p-sulfobenzaldehyde de- 
hydrogenase TsaD. The stop codon of tsaC was found to over- 
lap the start codon (ATG, position 4239) of a 1,427-bp ORF, 
termed tsaD, with a potential Shine-Dalgarno sequence 
(ACGGG) at position 4223 and a stop codon (TGA) at posi- 
tion 5667. The deduced amino acid sequence (Fig. 5) includes 
the N-terminal sequence determined for the aldehyde dehy- 
drogenase (Fig. 1), and M rCitlQ (51,010) corresponds to the 
observed value (56,000); tsaD thus encodes the p-sulfobenzal- 
dehyde dehydrogenase, A data bank analysis showed high lev- 
els of similarity to a wide variety of aldehyde dehydrogenases 
(Table 3). The N-terminal region is poorly conserved, but a 
number of strictly conserved amino acids were found, espe- 
cially Gly residues which are distributed throughout the pro- 
tein. The residues Cys-286 (Fig. 5) and Gly-230 to Gly-235 
(apa motif) may be part of the NAD binding fold and Glu-252 
is thought to be part of the active site (32). However, this 
interpretation conflicts with another (30), and no crystal struc- 
ture is available to resolve the disagreement. 


The region downstream of tsaMBCD. No ORF was found 
downstream (370 bp) of tsaMBCD. A putative transcription 
stop signal [stem-loop and poly(T) sequence] in the region 
from 5688 to 5718 and the functionality of the upstrean) region 
(see below) indicate that this gene cluster represents an 
operon. 

Sequencing of the putative LysR-type transcriptional regu- 
lator, tsaR. At 114 bp upstream of tsaM an ORF, termed tsaR , 
was found in the reverse orientation to tsaMBCD. The putative 
start codon (GTG) of tsaR was attributed to position 1362, the 
stop codon (TGA) was attributed to position 468, and a po- 
tential Shine-Dalgarno sequence (CGGGA) was attributed to 
position 1374. A sequence analysis showed that tsaR would 
encode a protein containing 298 amino acids (M r calc = 32,630) 
with similarities, largely at the N terminus (where the helix- 
turn-helix motif for DNA binding was located), to LysR-type 
transcriptional regulators (Table 3; Fig. 5 [where only highly 
conserved residues of the apa structure are marked]) (59). 

The region downstream of tsaR. No ORF was found down- 
stream (465 bp in the direction of transcription) of tsaR. A 
putative transcription stop signal [stem-loop and poly(T) se- 
quence] in the region from 387 to 407 and the location of the 
tsaMBCD operon indicate that this single gene constitutes a 
separate transcriptional unit. 

Structure of the region between tsaMBCD and tsaR. The 
intergen ic region between tsaR and tsaM, 114 bp, was found to 
have a GC content of 48%, much lower than the 70% observed 
in the coding regions. A low GC content of the intergenic 
region has been observed previously between a LysR-type reg- 
ulator gene and the neighboring gene(s) it regulates (77). One 
putative recognition site for a LysR-type regulator was found, 
position 1376 (TTC-8 bp-GAA), which corresponds to Schell’s 
model (59). A putative sigma™ “housekeeping” promoter of 
operon tsaMBCD is located upstream from tsaM, though the 
Pribnow box TAGCAT at position 1442 and the -35 region 
(1418; TTGTTG) show only weak homologies with the con- 
sensus sequences (45). Another potential promoter, which 
could be recognized by a sigma 70 factor, lies upstream of tsaR: 
the Pribnow box (position 1401; TATAA; reverse strand) is 
identical to the consensus, whereas the -35 region (position 
1424; ACAACA; reverse strand) has poor homology with the 
consensus sequence. 

DISCUSSION 

Each of the genes in the cluster tsaMBCD has been identi- 
fied from the deduced N-terminal amino acid sequence (Fig. 5) 
and by its size. This cluster thus encodes the inducible enzymes 
which catalyze the reactions (Fig. 1) of the physiologically 
defined regulatory unit R1 involved in the degradation of tolu- 
enesulfonate in C. testosieroni T-2 (61). The evidence that the 
cluster represents a four-gene operon lies in the absence of 
neighboring genes (in the direction of transcription (Fig. 5), in 
the putative stop signal (Fig. 5), and in the detection of a 4.5-kb 
mRNA (33a), i.e., a polycistronic messenger of the correct 
length. 

We presume that the putative LysR-type regulatory gene, 
tsaR, is expressed as a one-gene transcriptional unit and that its 
product is involved in both autoregulation and regulation of 
expression of the neighboring tsa catabolic operon. tsaR is 
typical of the supergene family, both from its structure and 
from the nature of the intergenic region with which it would 
have to interact (59), but only experiments will confirm this 
hypothesis. The induction of operons controlled by LysR-type 
regulators takes place through the binding to that protein of a 
substrate or an intermediate of the pathway, so toluenesulfon- 
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FIG. 5. Nucleotide sequences of the toluenesulfonate operon tsaMBCD and 
of its putative regulator gene LsaR. Positions 1 to 1362, reverse strand; positions 
1321 to 1500, double strand; positions 1501 to 6040, sense strand. Putative 
regulator tsaR (positions 1362 to 469, reverse strand), methyl-monooxygenase 
tsaM (positions 1477 to 2517), reductase tsaR (positions 2517 to 3467), alcohol 
dehydrogenase tsaC (positions 3484 to 4239), and aldehyde dehydrogenase tsaO 
(positions 4239 to 5666) are shown. Start codons and stop codons are in boldface 
letters. The chemically sequenced N-terminal and interndl amino acids are un- 
derlined, and the amino acids used to deduce the oligonucleotide probes are 
doubly underlined. Putative transcriptional stop signals (poIy(T) nucleotides and 
stem-loops] are underlined and in boldface. Restriction sites used for subcloning 
are also shown. Putative Shine-Dalgarno sequences are indicated by a lighlface 
asterisk. Selected, conserved amino acids are in boldface with a boldface asterisk. 
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TABLE 3. Comparison of TsaMBCD and TsaR with related proteins 


Protein 


Structure 


Position 

GC content 
(%) 



Related proteins* 



SIT 

Gene 

Name 

% Identity 

otic 

Structure 

GC content 
(%) 

Accession no. 

TsaR 

32,630 (298) 

ND rf 

1374 

1362-469 

70 

Nac 

27.8 

32,754 (305) 

ND 

62 

Q08597 







ClcR 

27.2 

32,527 (294) 

ND 

56 

L06464 







SdsB 

27.1 

32,955 (306) 

ND 

74 

M86744 







CynR 

26.9 

32,961 (299) 

ND 

55 

P27111 







TfdR 

25.9 

32,070 (295) 

ND 

67 

P 10086 

TsaM 

39,557 (347) 

Tetramer 

1464 

1477-2517 

66 

VanA 

32.7 

36,578 (329) 

ND 

69 

P 12609 







Pht3 

23.8 

49,297 (439) 

Tetramer 

59 

D 13229 







CbaA 

23.7 

48,927 (432) 

ND 

59 

U00692 







PobA 

21.0 ' 

46,258 (409) 

ND 

67 

X78823 

TsaB 

34,186 (317) 

Monomer 

2505 

2517-3467 

71 

VanB 

41.2 

33,706(314) 

ND 

73 

PI 2580 







PobB 

38.9 

33,558 (319) 

ND 

73 

X78823 







Pht2 

34.1 

35,999 (324) 

Monomer 

57 

D 13229 







CbaB 

30.9 

31,699 (288) 

ND 

59 

U00692 

TsaC 

26,438 (252) 

Dimer 

3473 

3484-4239 

69 

Xan 

38.9 

26,143 (250) 

ND 

69 

S47054 







BudC 

37.4 

25,313(241) 

Tetramer 

- 63 

Q04520 







3Bhd 

37.4 

26,848 (253) 

ND 

59 . 

P19871 







LinX 

36.8 

25,492 (250) 

ND 

59 

D23722 







Dhkl 

36.1 

28,393 (272) 

ND 

72 

PI 6542 

TsaD 

51,010 (476) 

Dimer? 

4223 

4239-5666 

71 

GabD 

38.7 

51,719(482) 

ND 

58 

P25526 







AldH 

34.7 

54,158 (497) 

ND 

54 

X95396 







Aid 

34.0 

52,142 (478) 

Tetramer 

51 

P25553 







DhaC 

33.7 

54,743 (500) 

Dimer 

ND 

PI 5437 







AldH6 

33.3 

56,009 (512) 

ND 

55 

A55684 


" M r ,c*\v molecular weight calculated from amino acid composition. The total number of amino acids is in parentheses. 

b LysR-type regulators: Nac, nitrogen assimilation control regulator; CynR, cyanate operon regulator; SdsB, sodium dodecyl sulfatase regulator; ClcR, chlorocatechol 
operon regulator; TfdR, 2,4-dichlorophenoxyacetate operon regulator. Oxygenases and reductases: VanAB, Pht23, CbaAB, and PobAB (see legend for Fig. 6). Group 
II dehydrogenases (DH): Xan, hypothetical protein; BudC, acetoin (diacetyl) DH; 3Bhd, 3-p-hydroxysteroid DH; LinX, 7-hexachlorocyclohexane dehydrochlorinase; 
Dhkl, putative ketoacyl reductase (granaticin polyketide synthase). Aldehyde DHs: GabD, succinate-semialdehyde DH; AldH, aldehyde DH; Aid, lactaldehyde DH; 
DhaC, horse cytosol aldehyde DH; AldII6, human cytosol aldehyde Dll. 

* SD, Shine-Dalgamo sequence. 
d ND, not determined. 


ate or toluenecarboxylate or the intermediates in the pathway 
(Fig. 1) could interact with TsaR. The LysR-type autoregula- 
tory transcriptional regulators are widespread and regulate 
many target genes, which include catabolic operons for xeno- 
biotics (13, 75). A subfamily of regulators for chlorocatechol 
degradation has been described (62) that shows homologies to 
TsaR (see Table 3). LysR-type regulators are often plasmid 
encoded and involved in the positive regulation of catabolic 
operons. TfdR (Table 3) is located on plasmid pJP4, which 
belongs to the same incompatibility group, IncPlp, as plasmid 
pTSA (see below) and has a similar GC content. 

The genes we have sequenced (Fig. 5) encode regulatory and 
catalytic functions. We know that transport is essential to the 
metabolism of sulfonates, both theoretically, because these 
charged compounds would otherwise not come in contact with 
the soluble cytoplasmic enzymes catalyzing their degradation, 
and from direct observation of a toluenesulfonate transport 
system in strain T-2 (42). The inducible transport system must 
thus be encoded at another locus, and it will be interesting to 
discover whether TsaR also regulates its expression. The en- 
zymes encoded by the tsa catabolic operon also convert p- 
toluenecarboxylate to terephthalate when the organism utilizes 
toluate (41). It is unclear whether a toluenecarboxylate trans- 
port system is required for growth by strain T-2, but when at 
least porins are required for the metabolism of xylene (XylN) 
and benzene (27a, 79), it seems likely that some membrane 


structure will be required by the organism to utilize toluene- 
carboxylate. 

The GC content of total DNA of C. testosteroni T-2 is 
61.8% ± 0.5%, typical of the species (10). Chromosomally 
located genes in C. testosteroni have a GC content of 61% ± 
1% with 73% for the third base (1, 2, 8, 11, 20, 36, 50). In 
contrast, our ORFs show a high GC content (70%) and an 
extremely high GC content in the third base (91%). The codon 
usage among the genes in both the tsa operon and tsaR is the 
same but different from that of the chromosomal genes. We 
have now shown that the tsa operon and tsaR are located on 
plasmid pTSA, and we conclude that these genes were re- 
cruited from another organism (33b). 

Aerobic metabolism of inert aromatic compounds typically 
starts by oxygenation at the ring or at the (methyl) side chain 
(see reference 41 and citations therein). Although many ring 
oxygenations have been studied, our understanding of oxygen- 
ation of the methyl side chain is limited largely to work related 
to the TOL plasmids, and even there, the oxygenase XylM has 
not been purified (70). The work with C. testosteroni T-2 thus 
presents the first set of data in which all catabolic enzymes 
involved in side chain manipulation have been purified, char- 
acterized, and sequenced. By comparison of the TOL and TSA 
systems, we see clearly that nature has used very different 
detailed solutions for oxidizing a methyl group by the “same” 
pathway. The TOL upper pathway has a different gene order 
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FIG. 6. Rooted evolutionary tree of class. I A multicomponent oxygenases 
with representatives from classes IB, IIB, and III based on the amino add 
composition of the oxygenases. VanA, vanillate demethylase (9); CbaA, 3-chlo- 
robcnzoatc 2,3-dioxygcnasc (46); Pht3, phthalatc 4,5-dioxygcnasc (49); PobA, 
phcnoxybermialc dioxygen asc (15); BenA, benzoate di oxygenase (accession 
number [AC.J P07767); BnzA, benzene di oxygenase (AC. P08084); NdoB, naph- 
thalene dioxygenase (AC. P 23094). The N-terminal amino acid sequence of 
/7-sulfobcnzoatc dioxygcnasc PsbA (another dioxygenase system in the degrada- 
tivc pathway of toluenesuUonale) (39) displays a high sequence identity value of 
48% with CbaA over the available 35-amino-acid sequence, indicating that the 
oxygenases are closely related (46) and that PsbAC is another class I A dioxyge- 
nasc system. p-Mcthoxybcnzoatc demethylase system MbdAB (no sequence data 
available) apparently shares substrates with vanillale-demclhylase system 
VanAB (3-methoxybenzoate and 3,4-dimethoxybenzoate) and monooxygenase 
system TsaMB (4-methylbenzoate and 4-methoxybenzoate), whereas TsaMB 
docs not share any common substrate with VanAB. Wc expect high levels of 
sequence similarity between monooxygenase MbdAB and the other two mono- 
oxygenases. 


than the tsa catabolic operon and includes a transport function 
(XylN); different regulatory systems are used (sigma 54 pro- 
moter, integration host factor, and the NtrC-like regulator 
XylR in the TOL system contrasted with putative sigma' 0 pro- 
moters and LysR-type regulation) and different types of mono- 
oxygenases and different gene families of alcohol dehydroge- 
nases (class I in TOL [67] and class II in tsa [Table 3]) are 
involved. Only the aldehyde dehydrogenases (Table 3) (see the 
introduction) belong to the same family, with 29% sequence 
identity between TsaD and XylC being determined. When two 
types of attack on a methyl side chain differ so markedly (see 
also reference 31), we may expect yet more diversity when 
further pathways of oxidation of aromatic methyl groups are 
examined. 

The short-chain alcohol dehydrogenases have been found 
frequently in aromatic metabolism, but usually as diene-diol 


dehydrogenases (e.g., XylL) (47), which represent a very dif- 
ferent reaction type (regeneration of the aromatic ring struc- 
ture) from that of TsaC (Fig. 1). Sequence comparisons similar 
to those in Table 3 show that the diene-diol dehydrogenases 
have identities with TsaC lower than 32% (Table 3). 

As anticipated from biochemical data (44); monooxygenase 
system TsaMB has a high level of homology to class IA oxy- 
genase systems, the reductases showing a higher level of iden- 
tity than the oxygenases (Table 3). We generated an evolution- 
ary tree for oxygenase components of oxygenase systems with 
an arbitrarily set root (Fig. 6). It was apparent that, despite the 
similarities in class IA, there are two separate evolutionary 
lines in this group of oxygenase components. The subdivision 
of the class LA oxygenase components into monooxygenases 
and dioxygenases, considered with the similarity of the Rieske 
centers (Table 4) (27), presumably indicates differences due to 
the binding site for the aromatic substrate and oxygen. In 
contrast, no subgroups were apparent in trees obtained from 
alignments of sequences from reductase components (not 
shown). It is well known that the domain structure of the 
reductases in class IA differs from those in other classes. Thus, 
we rationalize the homogeneity in the class IA reductases in 
the common property of supplying electrons to the oxygenase 
components. 

Whereas there is considerable understanding of the function 
of the reductases of the mononuclear iron oxygenase systems 
(14, 22, 44) and a steady increase in our understanding of the 
[2Fe-2S] Rieske center in the oxygenase (27, 28, 44), there is 
still very little information on the mononuclear iron itself. The 
long-standing data from Bernhardt's group (7), from which no 
sequence data are available, are only now being expanded (72). 
The close biochemical similarity of monooxygenase TsaMB 
and />-methoxybenzoate demethylase (M n redox centers, 
amino acid composition, and overlapping substrate ranges 
[e.g., 4-methoxybenzoate and 4-methylbenzoate]) (40) serves 
to link the biophysical work (7) with the biochemical and/or 
sequencing work done with several dioxygenase systems (Table 
3). 

The catalytic enzymes (TsaM, TsaC, and TsaD) encoded by 
the tsa operon do not discriminate between the sulfonate and 
the carboxyl substituents, whereas the subsequent enzymes do 
(61). Despite the similarities of both aromatic substituents, it 
seems that their different physical and chemical properties 
sometimes require specialized enzymes. Each class IA oxygen- 
ase requires an aromatic carboxyl or, as in this case, an aro- 
matic sulfonate substituent in its substrate. As monooxygenase 
TsaM can accommodate a sulfonate substituent, the question 
of whether other class IA oxygenases can tolerate a sulfonate 
substituent and still oxygenate the compound arises. 

The high degree of similarity of monooxygenase and dioxy- 
genase systems seen in Fig. 6 (51), in the face of old strictures 


TABLE 4. Data from EPR spectra of the [2Fe-2S] centers in the oxygenase components of some multicomponent oxygenase systems 


Enzyme 

Class 


Sy 

& 

Center 

Source or reference 

4-Toluenesulfonate methyl- mo nooxygenase 

IA 

2.019 

1.918 

1.790 

Rieske 

This paper 

4-Sulfobenzoate DOS° 

IA 

2.025 

1.921 

1.745 

Rieske 

35 

4-Methoxybenzoate demethylase 

IA 

2.008 

1.913 

1.72 

Rieske 

73 

Phthalate DOS 

IA 

2.016 

1.914 

1.763 

Rieske 

12 

4-Chlorophenylacetate DOS 

IA 

2.021 

1.922 

1.737 

Rieske 

43 

2-Halobenzoate DOS 

IB 

2.025 

1.912 

1.788 

Rieske 

19 

Pyrazon DOS 

IIA 

2.02 

1.91 

1.79 

Rieske 

58 

Benzene DOS 

IIB 

2.018 

1.917 

1.754 

Rieske 

23 

Naphthalene DOS 

III 

2.01 

1.91 

1.80 

Rieske 

69 


" DOS, dioxygenase system. 
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from the Enzyme Commission (81), is not restricted to se- 
quence comparisons. Not only can dioxygenase systems mono- 
oxygenate some substrates (26, 78), but a monooxygenase sys- 
tem (MbdAB) can dioxygenate at least one substrate (82). 
When promulgated, the wording of the Enzyme Commission 
rule was quite correct (29). It would seem appropriate to adjust 
the rules of nomenclature to the biochemical and molecular 
biological facts, especially as a monooxygenase system in class 
IB has also been detected (56). 
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Active site structure of Rieske-type proteins: electron nuclear double resonance 
studies of isotopically labeled phthalate dioxygenase from Pseudomonas cepacia 
and Rieske protein from Rhodobacter capsulatus and molecular modeling studies of 
a Rieske center. 

Gurbiel RJ, Doan PE, Gassner GT, Macke TJ, Case DA, Ohnishi T, Fee JA, Ballou 
DP, Hoffman BM. 

Department of Chemistry, Northwestern University, Evanston, Illinois 60208, USA. 

Continuous wave electron nuclear double resonance (CW ENDOR) spectra of 
[delta- 15N,epsilon(-14)N]histidine-labeled phthalate dioxygenase (PDO) from 
Pseudomonas cepacia were recorded and found to be virtually identical to those 
previously recorded from [delta,epsilon-15N2]histidine-labeled protein [Gurbiel, 

R. J., Batie, C. J., Sivaraja, M., True, A. E., Fee, J. A., Hoffman,. B. M., & 

Ballou, D. P. (1989) Biochemistry 28, 4861-4871]. Thus, the two histidine 
residues, previously shown to ligate one of the irons in the cluster [cf, 

Gurbiel et al. 1989)], both coordinate the metal at the N(delta) position of 
their imidazole rings. Pulsed ENDOR studies showed that the "remote", 
noncoordinating nitrogen of the histidine imidazole ring could be observed from 
the Rieske protein in a sample of Rhodobacter capsulatus cytochrome bcl complex 
uniformly labeled with 15N but not in a sample of PDO labeled with 
[delta- 15N, epsilon- 14N]histidine, but this atom was easily observed with a 
sample of Rh. capsulatus cytochrome bcl complex that had been uniformly labeled 
with 15N; this confirmed the conclusion from the CW ENDOR studies that ligation 
is exclusively via N(delta) for both ligands in the PDO center. Modifications in 
the algorithms previously used to simulate 14N ENDOR spectra permitted us to 



compute spectra without any constraints on the relative orientation of hyperfine 
and quadrupole tensors. This new algorithm was used to analyze current and 
previously published spectra, and slightly different values for the N-Fe-N angle 
and imidazole ring rotation angles are presented [cf. Gurbiel et al. (1989) 

Gurbiel, R. J., Ohnishi, T., Robertson, D. E., Daldal, F., and Hoffman, B. M. 

(1991) Biochemistry 30, 11579-11584]. This analysis has permitted us to refine 
the proposed structure of the [2Fe-2S] Rieske-type cluster and rationalize some 
of the properties of these novel centers. Although the spectra of cytochrome bcl 
complex from Rh. capsulatus are of somewhat lower resolution than those obtained 
with samples of PDO, our analysis nevertheless permits the conclusion that the 
geometry of the cluster is essentially the same for all Rieske and Rieske-type 
proteins. Structural constraints inferred from the spectroscopic results 
permitted us to apply the principles of distance geometry to arrive at possible 
three-dimensional models of the active site structure of Rieske protein from Rh. 
capsulatus. Results from this test case indicate that similar procedures should 
be generally useful in metalloprotein systems. We also recorded the pulsed and 
CW ENDOR spectra of 57Fe-labeled PDO, and the resulting data were used to derive 
the full hyperfine tensors for both Fe(IH) and Fe(II) ions, including their 
orientations relative to the g tensor. The A tensor of the ferric ion is 
nominally isotropic, while the A tensor of the ferrous ion is axial, having 
A(parallel) > A(perpendicular); both tensors are coincident with the observed g 
tensor, with A(parallel) of the ferrous ion lying along the maximum g-value, gl. 

These results were examined using refinements of existing theories of 
spin-coupling in [2Fe-2S]+ clusters, and it is concluded that current theories 
are not adequate to fully describe the experimental results. 
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Functional and evolutionary relationships among diverse oxygenases. 

Harayama S, Kok M, Neidle EL. 

Department of Medical Biochemistry, University of Geneva, Switzerland. 

Oxygenases that incorporate one or two atoms of dioxygen into substrates are 
found in many metabolic pathways. In this article, representative oxygenases, 
principally those found in bacterial pathways for the degradation of 
hydrocarbons, are reviewed. Monooxygenases, discussed in this chapter, 
incorporate one hydroxyl group into substrates. In this reaction, two atoms of 
dioxygen are reduced to one hydroxyl group and one H20 molecule by the 
concomitant oxidation of NAD(P)H. Dioxygenases catalyze the incorporation of two 
atoms of dioxygen into substrates. Two types of dioxygenases, aromatic-ring 
dioxygenases and aromatic-ring-cleavage dioxygenases, are discussed. The 
aromatic-ring dioxygenases incorporate two hydroxyl groups into aromatic 
substrates, and cis-diols are formed. This reaction also requires NAD(P)H as an 
electron donor. Aromatic-ring-cleavage dioxygenases incorporate two atoms of 
dioxygen into aromatic substrates, and the aromatic ring is cleaved. This 
reaction does not require an external reductant. All the oxygenases possess a 
cofactor, a transition metal, flavin or pteridine, that interacts with dioxygen. 

The concerted reactions between dioxygen and carbon in organic compounds are 
spin forbidden. The cofactor is used to overcome this restriction. For the 
oxygenases that require the NAD(P)H cofactor, the enzyme reaction is separated 
into two steps, the oxidation of NAD(P)H to generate two reducing equivalents, 
and the hydroxylation of substrates. Flavoprotein hydroxylases that catalyze the 
monohydroxylation of the aromatic ring carry out these two reactions on a single 



polypeptide chain. In other oxygenases, the NAD(P)H oxidation and a 
hydroxylation reaction are catalyzed by two separate polypeptides that are 
linked by a short electron-transport chain. Two reducing equivalents generated 
by the oxidation of NAD(P)H are transferred through the electron-transport chain 
to the co factor on a hydroxylase component that they reduce. Dioxygen couples 
with the reduced co factor and subsequently hydroxylates substrates. The 
electron-transport chains associated with oxygenases contain at least two redox 
centers. The first redox center is usually a flavin, while the second is an 
iron-sulfur cluster. The electron transport is initiated by a single 
two-electron transfer from NAD(P)H to a flavin, followed by two single-electron 
transfers from the flavin to an iron-sulfur cluster. The primary sequences of 
many oxygenases have been determined, and according to their sequence 
similarities, the oxygenases can be grouped into several protein families. Among 
proteins of the same family, the sequences in regions involved in cofactor 
binding are strongly conserved. Local sequence similarities are also observed 
among oxygenases from different families, primarily in regions involved in 
cofactor binding. 
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